home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
TeX 1995 July
/
TeX CD-ROM July 1995 (Disc 1)(Walnut Creek)(1995).ISO
/
dviware
/
screenview
/
vms
/
crudetype.web
(
.txt
)
< prev
next >
Wrap
LaTeX Document
|
1994-04-24
|
118KB
|
2,804 lines
% ADAPTED FROM DVITYPE, VERSION 2.6.
% REVISIONS:
% 9/86: clarify names of global variables, supply hooks for attempted
% Hewlett-Packard Laserjet version.
% 1/88: Several bugfixes. Chiefly the noscheme bug (TFM files without coding
% schemes)
% Also, added some MATH EXTENSION character codes.
% 4/88: Unix change file by P. King.
% 6/88: Changes to produce a "normal" VMS text file by Andrew Trevorrow.
% All such changes are flagged by AKT comments.
% Here is TeX material that gets inserted after \input webmac
\def\hang{\hangindent 3em\indent\ignorespaces}
\font\ninerm=cmr9
\let\mc=\ninerm % medium caps for names like PASCAL
\def\PASCAL{{\mc PASCAL}}
\def\(#1){} % this is used to make section names sort themselves better
\def\9#1{} % this is used for sort keys in the index
\def\title{Crudetype}
\def\contentspagenumber{1}
\def\topofcontents{\null
\def\titlepage{F} % include headline on the contents page
\def\rheader{\mainfont\hfil \contentspagenumber}
\vfill
\centerline{\titlefont Crudetype}
\vskip 50pt
\centerline{An adaptable device driver (Version 1, 1988)}
\vskip 50pt
\centerline{R.M.Damerell,} \vskip 30pt
\centerline{Mathematics Dept.,} \vskip 20pt
\centerline{Royal Holloway and Bedford College,} \vskip 20pt
\centerline{Egham, Surrey, U.K.} \vskip 20pt
\vfill}
\pageno=\contentspagenumber \advance\pageno by 1
% These macros for verbatim scanning are copied from MANMAC.TEX. But we cant
% use the vertical bar for a temporary escape character as WEAVE catches it.
% So we will use ! instead and hope for the best
\chardef\other=12
\def\ttverbatim{\begingroup
\catcode`\\=\other \catcode`\{=\other \catcode`\}=\other \catcode`\$=\other
\catcode`\&=\other \catcode`\#=\other \catcode`\%=\other \catcode`\~=\other
\catcode`\_=\other \catcode`\^=\other
\obeyspaces \obeylines \tt}
\outer\def\begintt{$$\let\par=\endgraf \ttverbatim \parskip=0pt
\catcode`\!=0 \rightskip-5pc \ttfinish}
{\catcode`\!=0 !catcode`!\=\other % ! is temporary escape character
!obeylines !obeyspaces % end of line is active
!gdef!ttfinish#1^^M#2\endtt{#1!vbox{#2}!endgroup$$}}
\def\up{\hbox{\tt{\char'013}}}
\def\markarrow#1{\vtop{\hbox{#1}\up}}
@* Introduction.
COPYRIGHT ( C ) R.M.Damerell, 1988.
Permission is given to any person to make and distribute copies of this
software, subject to the following conditions:
1. All copies of the software must carry an exact copy of this notice.
2. This software is distributed free of charge, "AS IS" with absolutely no
guarantee of performance. Any persons receiving or using this software must do
so entirely at their own risk. Neither the authors nor their institutions
accept any liability for any defects of this software, or for any consequential
loss or damage however caused.
3. Any person who changes this software must clearly mark it as modified and
add a note describing the changes made.
This is an experimental version and no guarantee of performance is given.
I would like to receive bug reports, same address or electronic mail to
DAMERELL at ARPA.UCL.CS.NSS. \par\vskip 0.5in
This program was originally based on D.E.Knuth's program \.{DVItype}, but so
many changes were needed for various reasons that there is hardly any of the
original code left. The purpose of this program is to provide a framework for
users to write \TeX\ device drivers for a variety of `crude' devices. Roughly
speaking, `crude' means any printer that cannot print the fonts that Metafont
generates. This would include daisy-wheels and most impact dot-matrix
printers. Considered as output printers for \TeX, such devices usually have
some of the following misfeatures: \item
1. Coarse resolution.\item
2. Restricted character set. \item
3. Some printers cannot do reverse line feeds, some can, and tear the paper.
\item
4. Slow interface between CPU and printer.\par
Although such printers cannot do justice to \TeX\ output, drivers for them
are still needed. Some users cannot afford high quality printers. Some can
only afford to use them for final output; so they need to make proofs on a
cheaper printer. Also, anybody who has a high quality printer may well need
to refer to various \.{WEB} files while writing a driver for it. These can
become illegible in critical places. Here is a sample from \.{DVItype}:
\begintt
A |fix_word| whose respective bytes are $(a,b,c,d)$ represents the number
$$x=\left\{\vcenter{\halign{$#$,\hfil\qquad&if $#$\hfil\cr
b\cdot2^{-4}+c\cdot2^{-12}+d\cdot2^{-20}&a=0;\cr
-16+b\cdot2^{-4}+c\cdot2^{-12}+d\cdot2^{-20}&a=255.\cr}}\right.$$
\endtt
Using the basic (line printer) version of \.{Crudetype}, we can get a copy of
these formulae which is at least legible, even though the result may not be at
all pleasant to look at. A further difficulty with conventional drivers is
that most of these use the algorithm `paint a page of pixels, send it down the
line'. This places a heavy load on both the host computer and the link to the
printer. Of course, one can try to reduce this load by various optimisations,
(e.g. by writing critical bits of code in machine language) but this makes the
program non-portable, and often introduces bugs. \.{Crudetype} is written
entirely in \PASCAL, without any attempt at optimisation. When compiled on a
VAX 780 with the NO-OPTIMISE, CHECK and DEBUG qualifiers it runs at about 2--3
seconds a page. These times are highly variable, and the VMS optimiser reduces
them by about 10-15\%.
@ Printers vary enormously both in their capabilities and in the commands that
drive them. The behaviour of \.{Crudetype} is controlled by a large number of
constants, which supposedly describe how the target printer does things. This
does have the disadvantage that the user must compile a separate copy of the
program for each different printer, and also devise some way to ensure that he
uses the right version for the intended printer. But the only alternative
seemed to be that \.{Crudetype} should read and parse a file describing the
printer and this appeared to be unbearably messy. Ideally, these constants
should be so designed that: \item
1. Any decent printer can be driven by assigning the right values to these
constants and recompiling. \item
2. If the printer is properly documented, it should be immediately obvious
what are the correct values for all these constants.
At present I do not have enough experience of different printers to come near
this ideal. In particular, some printers can download characters. The
problems of writing a program to support this facility in proper generality
are horrible and ghastly. I have not made any serious attempt yet to tackle
them. There are just a few places where a hook appears, and I hope eventually
to attach actual routines for downloading.
Some of the more obvious problems of downloading are: when can you download?
At any time? start of page? or only at start of document? Can you load one
character, or must you load a whole font at a time? How much memory does the
printer provide for down loading? How efficiently does it use its memory? What
does it do when it runs out? Can you clear out old fonts to make more space?
What is the format of a down-load command? What parameters does it need, in
what order, with what punctuation? In what order must pixels be sent? Should
they be compressed, and how?
@* Implementation.
The original version of \.{Crudetype} was aimed at a line printer, (because
everybody has these) and was written on the VAX-VMS operating system. The
intention is that this program should be easily adaptable both to other
systems and to other printers. So most of it is written in Standard \PASCAL.
(It is not possible to tell exactly how much of it is Standard, as we do not
have a certified compiler.) But in some places, it is necessary to use
extensions. In particular, \.{Crudetype} must read the font files, whose names
are dynamically specified. That would be impossible in pure \PASCAL.
\.{Crudetype} also uses non-Standard code in order to talk to the user's
terminal. It asks for the name of the \.{DVI} file, and for the first page and
the number of pages to print. If an operating system forbids terminal
interaction, the installer will have to find another way to give the program
this information. As file handling is inevitably system-dependent, I have here
allowed myself a lot of latitude in using VMS-specific procedures. If
\.{Crudetype} cannot find a file, it will ask the user for another name. On
the other hand, all files are read and written sequentially, and I have got
rid of all uses of the default |case| statement. The intention is that all the
system-dependent stuff goes near the top of the file, and all
printer-dependent stuff at the end. Then with any luck you can merely
concatenate Change files for the local system and the local printer, instead
of having to merge them. All the code that is known to be non-Standard has
been carefully segregated from the rest of the program. It amounts to about 20
lines out of 750.
@.System dependencies@>
It is clearly impossible to predict what difficulties will appear in trying
to install \.{Crudetype} on other systems, it would seem to be advisable to
get the line printer version working before trying to adapt it for any other
printer. To try to ease the process, I propose to distribute several test
files with the program. These are of the form SAMPLE.TEX, SAMPLE.DVI and
SAMPLE.PRI (the line printer output).
Although `crude' printers differ very much in their capacities, one thing
they nearly all have in common is that they cannot feed the paper backwards.
Some printers cant |Backfeed| at all; some tear the paper, and others let the
paper slip and so lose position. Therefore it seems to be essential to process
each page as follows: first copy the page into a suitable structure, then sort
it by vertical and horizontal position, then print it.
@* Main Program.
@d banner=='This is Crudetype, Version 1, copyright, experimental'
{printed when the program starts}
@p program crudetype
@<Declarations@>
begin
@<Initialize@>;
read_BOP; {AKT: was at start of For each page...}
repeat
@<For each page of \.{DVI}, print it if desired@>
until time_to_stop ;
@<Clean up afterwards@>;
666: {AKT: come here instead of crashing!}
end.
@ Now here are some of the messy things we must do to satisfy the rules of
\PASCAL.
@<Declarations @>=
(@<Files @>) ;
label 666; {AKT}
const @< Constants in the outer block @>
type @< Types in the outer block @>
var @< Globals in the outer block @>
@<Forward declarations @>
@<Lowest level procedures @>
@<Medium level procedures @>
@<Top level procedures@>
@ @<Initialize@>=
@<Set initial values@>
@<Open terminal channels@>
@<Assign character codes@>
@<Dialog with user@>
@<Open \.{DVI} file@>
@<Open |printfile|@>
@<Read \.{DVI} preamble @>
@ @<Glob...@>=
in_i, in_j :integer; {loop index for initializations}
@ Next, here are some macros for common programming idioms.
@d incr(#) == #:=#+1 {increase a variable by unity}
@d decr(#) == #:=#-1 {decrease a variable by unity}
@d do_nothing == {empty statement}
@d exit == 732
@d return == goto exit
{Go here when a loop ends abnormally}
@ The next two procedures are very primitive debugging aids. All internally
detected errors call |error|. Then they can be caught (in VMS) by the debugger
command\begintt
DBG>set break error \endtt
If a fatal error occurs, then the program will force a crash. With the VMS
debugger, you can then interrogate variables, etc. I chose the square root of
$-1$ as this does not figure prominently in \TeX-related programs.
@^square root@>
@<Forward...@>=
procedure error ; begin end;
procedure crash;
var u: real;
begin
goto 666; {AKT: was
u := -1 ;
error;
u := sqrt(u) ;
}
@* Interface to Operating System, 1: Material specific to VAX/VMS.
The purpose of these sections is to try to give a reasonable interface between
the operating system and the rest of the program, which is supposed to be
Standard \PASCAL. Nearly all the non-Standard code is concerned with file
handling and the lowest level of I/O. This is an area where Standard \PASCAL\
seems to be particularly weak. This particular section contains all of the
most system dependent code, and it will probably have to be entirely rewritten
for any other machine. It is hoped that most of the later sections will work
on a wide range of machines. Everything here is system dependent, so there is
no point in indexing each module separately.
@.System dependencies@>
@ The character set. I have here deleted all the code from \.{DVItype} that
translates from characters to small integers and back. This is because we have
to do a quite different translation anyway. If it is necessary to put that
code back in, then it will probably be necessary to insert \begintt
define zchr(#) == xchr[#] \endtt
because of the different brackets.
@d zchr == chr
@d zord == ord
@ Here are some other system-dependent types. We use double length arithmetic.
The VMS-specific function |dble| converts its argument to double precision.
Integers are 32 bits in VMS. Normally, I use |integer| whenever the bit length
is unimportant, but I use subranges in the |page_record| type, as this allows
packing and may improve the program's performance.
@d real_num == double
@d make_double == dble {convert a |real| to double length}
@d max_half = 32767
@<Types...@>=
byte = 0..255 ;
i_word = -max_half-1 .. max_half ;
@ Here we consider the lowest level of file handling. The main input file is
the |dvi_file|. Output for printing goes to |printfile| and diagnostic output
to |term_out|. The terms |display| and |print| are used instead of |write| so
that output may be redirected if desired. Throughout the program, it is
assumed that |@!write| appends its argument(s) to the current record of the
selected file, and |@!write_ln| ends that record and sends it off; this
behaviour is specified in the \PASCAL\ Standard. If these assumptions are
false, it will require major restructuring of the program.
These macros describe how we use the terminal. VMS actually opens the terminal
channels for us, but we get a better style of output by re-opening it, and
using these funny-looking macros to write to it. We can then print a stream of
progress reports without falling foul of a finite record length.
@d term_in==input {the terminal, considered as an input file}
@d term_out==output {and output}
@d i_reset_terminal == do_nothing {Switch terminal to input}
@d o_rewrite_terminal == do_nothing
{and back to output. VMS does all this automatically}
@d display(#)==write_ln(term_out, #)
@d display_ln(#)==write_ln(term_out, #, chr(13), chr(10))
@d c_con == @=carriage_control@>
@d warn (#)==begin display_ln('Error: ', #); error; end
@d abort(#)==begin display_ln('Fatal: ', #); crash; end
@d bad_dvi(#)==abort('Bad DVI file: ',# )
@<Files @>= term_in, term_out
@ @<Open terminal channels@>=
open(term_out, 'SYS$OUTPUT', c_con := none) ;
@ @<Const...@>= can_interact = true ;
@ The printed output goes to |@!printfile|.
@d print(#)==write(printfile, #)
@d print_ln ==write_ln(printfile )
@<Files@>= , printfile
@ @<Glob...@>= printfile : text ;
print_name: var_string ;
@ In this section we generate a name for the printed file. Essentially, this
involves deleting the `.DVI' at the end of the \.{DVI} filename and adding
`.PRI' instead. But: the \.{DVI} file might be in a funny place, and it might
have a funny extension. Of course, this code all depends crucially on the VMS
file name format, and it will probably be a lot more complicated on systems
that do not allow the elaborate facilities of the VMS |open| command.
@<Open |printfile|@>=
print_name := dvi_name ;
chop_tail(print_name, ':') ;
chop_tail(print_name, ']') ;
{Chop off directory, disc, and perhaps a logical name}
chop_top(print_name, '.') ;
chop_top(print_name, ';') ;
{and extension and version number}
append(print_name, print_end) ; {In VMS, usually `.PRI'}
display ('Output is '); {AKT: was PRINTFILE IS}
string_show(print_name);
display_ln(' ') ;
@.Printfile is...@>
@ VMS \PASCAL\ allows 3 types of carriage control, called |list|, |fortran|,
or |none|. No doubt other systems will have other peculiar types of carriage
control. In VMS, |none| is to be used if at all possible, but some printers
insist on a line feed after every carriage return. Roughly speaking, |@!list|
directs the operating system to put a CR--LF at the end of each record when the
file is printed. |@!fortran| means that a Fortran-type carriage control
character must be put at the start of each record, and \.{Crudetype} assumes
that this must be inserted explicitly. One type of run-time error that causes
a lot of trouble occurs if you try to write too many characters onto one
record of the |printfile|. I have tried to defeat this by declaring a very
long record length.
@ @d r_len == @=record_length@>
@<Open |printfile|@>=
if fortran then
open (printfile, print_name.data, c_con := fortran, r_len := 30000 )
else if list then
open(printfile, print_name.data, c_con := list, r_len := 30000 )
else
open(printfile, print_name.data, c_con := none, r_len := 30000 ) ;
rewrite(printfile) ;
@ Now here is the lowest-level procedure for opening binary files. This will
have to be rewritten to run on any other system. The VMS |open| procedure
tries to open the file with the given name; if bits of the name are missing,
it can obtain them from the |default| parameter. It generates a non-zero
|status| if it fails.
@d close_binary(#)==
close(# , @=error := continue@> )
@<Lowest...@>=
function open_binary
(var f_f: byte_file; name: var_string; other_name: def_name_type): boolean;
var s: integer;
begin
close_binary(f_f );
{in case the file was left open}
open(f_f, name.data , readonly, ,, fixed, default := other_name,
@=error := continue@> );
s := status(f_f) ;
if s <> 0 then open_binary := false
else begin
reset(f_f , @=error := continue@> );
s := status(f_f) ;
open_binary := (s = 0 );
end;
end;
@ Here we define system-dependent properties of these files. The easiest way
to tell VMS where to look for files is by giving them default names. These
should all be the same length if possible.
@d block_length = 512
@<Const...@>=
dvi_def = ' *.DVI' ;
tfm_def = 'TEX$FONTS:*.TFM' ;
pxl_def = 'TEX$PIXLDIR:*.*' ;
@ @<Types...@>= def_name_type = packed array [1..15] of char ;
@ Here are macros for the adaptable merge sort. See the section on sorting for
explanation.
@d image(#) == pool[#]
@d create == incr(cell)
@d link_type == page_i
@d first_cell == cell := 0
@d wipe_out(#) ==
@d declare_pool == pool: array [page_i] of page_record;
@d garbage == cell := zzz ;
@ These upper bounds are put in to catch runaway arguments.
@<Const...@>=
page_max = 10000 ;
max_line_size = 1000;
@* Interface to Operating System, 2: Terminal input and output.
When \.{DVItype} begins, it engages the user in a brief dialog so that the
options will be specified. This version of \.{Crudetype} does the same. This
requires nonstandard \PASCAL\ constructions to handle the online interaction.
So it may be necessary on some systems to omit the dialog. If so, the
installer must find some way to get the \.{DVI} file name into the |@!buffer|.
@<Dialog...@>=
{AKT: removed display_ln(banner, ' --- ', device_ID) ;}
buffer := blank ;
repeat
ask_prompt('DVI file name? ');
dvi_name := buffer ;
until dvi_name.len > 0 ;
ask_prompt('First page? (default = 0) ' );
first_page := get_number(0 ) ;
ask_prompt('maximum no. of pages? (default = 1000000) ' ) ;
max_pages := get_number(1000000) ;
@.DVI file name?@>@.First page?@>@.max. no. of pages?@>
@ Most characters in \TeX\ fonts are narrower than line-printer characters. So
we must spread them out to make them fit. Originally, this was done by
multiplying \.{DVI} distances by a constant factor |h_fudge|. This is all
right for one size of type but it tends to fail for other sizes because if the
predominant type size is larger than expected, then rounding with a constant
factor makes everything\qquad\ very\qquad\ badly\qquad\ spread\qquad\ out. It
seemed that the least bad way to tackle this is to allow the user to specify
an extra magnification factor.
@<Dialog...@>=
{AKT: removed
if can_interact then display_ln(
'What magnification? This must be an integer, and is a percentage');
ask_prompt ( 'Default = 100% = DVI file magnification ') ;
extra_mag := get_number(100)/100.0 ;
@.What magnification?@>
@ Since the terminal is being used for both input and output, some systems
need a special routine to make sure that the user can see a prompt message
before waiting for input based on that message. (Otherwise the message may
just be sitting in a hidden buffer somewhere, and the user will have no idea
what the program is waiting for.) Here, we assume that the system-dependent
macro |@!i_reset_terminal| (defined above) will do whatever is necessary to
switch the terminal from output to input. Likewise, |@!o_rewrite_terminal|
must switch it from input to output. Note that the program assumes that the
terminal is normally in output mode, and explicitly calls these macros when it
wants input. If the system does not allow this, then |@!can_interact| should
be set false.
Here is how the program prompts for input: the argument of |ask_prompt| is the
prompt text. Because of the anomalous behaviour of |write|, this ought to work
with arguments of any length, even on versions of \PASCAL\ that only allow
fixed length strings.
@d ask_prompt(#) == if can_interact then begin
{AKT: removed display_ln(#) ;} read_terminal ; end;
@<Lowest...@>=
procedure read_terminal;
var k: byte ;
begin i_reset_terminal;
buffer := blank ;
if not eof(term_in ) then begin
if eoln(term_in) then read_ln(term_in);
k:=0;
while not eoln(term_in) do
begin incr(k); buffer.data[k]:=term_in^; get(term_in);
end;
buffer.len := k ;
upcase(buffer) ;
end;
o_rewrite_terminal ;
end;
@ @<Glob...@>=
@!buffer: var_string ; {for terminal input}
@!extra_mag:real_num ;
@ The next function reads an integer from the |buffer|. It assumes a previous
call of |ask_prompt| and returns the default if the input is unrecognisable.
BODGE: this cant handle negative numbers.
@<Lowest...@>=
function get_number(default: integer): integer;
var k, m : integer; c:byte ;
begin
k := 0 ;
repeat
incr(k) ;
c := zord(buffer.data[k] ) ;
until (k > buffer.len)
or ((c <> " " ) and (c <> "+" ) ) ;
if (k > buffer.len) or (c < "0") or (c > "9" )
then get_number := default
else
begin
m:=0;
while (c >="0") and (c <="9") do begin
m:=10*m+ c -"0"; incr(k);
c := zord(buffer.data[k] ) ;
end;
get_number := m ;
end;
end;
@ If the printer is actually a VDU, then possibly the user will want to pause
at intervals.
@<Check pause@>=
if can_interact and do_pause and (PR_v >= next_pause) then begin
display_ln(pause_ask);
i_reset_terminal;
read_ln (term_in );
o_rewrite_terminal ;
string_show(pause_after);
next_pause := next_pause + pause_steps ;
end;
@ @<Pause reset@>=
if do_pause then
begin next_pause := pause_steps; @<Check pause@> end;
@ @<Const...@>=
@<Pause constants, to be defined by the printer change file@>
@ @<Glob...@>=
next_pause: integer;
pause_after: var_string ;
@* Interface to Operating System, 3: Input from binary files.
The main input file is the \.{DVI} file. Logically, this is just a stream of
8-bit bytes, with no record or block structure. However VMS \PASCAL\
apparently cannot handle files of this type; so I have adopted the blocking
scheme (due to D.R.Fuchs) from the VMS \.{DVItype} change file. But a lot of
the code has been rewritten. Some other operating systems use similar
blocking schemes; so this code may possibly work without much change. The
program deals with two binary file variables: |@!dvi_file| is the main input
file that we are printing, and |@!tfm_file| the current font metric file from
which character-width information is being read. Each of these has a name and
a counter, declared here; also a default name (system dependent, and so
declared previously). As an initial attempt at downloading, we declare a
|@!pxl_file|.
@^Fuchs, D.R.@>
@<Files...@>= , dvi_file, tfm_file, pxl_file
@ @<Glob...@>=
dvi_file, tfm_file, pxl_file: byte_file ;
dvi_name, tfm_name, pxl_name: var_string ;
dvi_indx, tfm_indx, pxl_indx: integer ;
font_ok: boolean ;
@ @<Types...@>=
@!byte_block=packed array [0..block_length-1] of byte ;
@!byte_file= packed file of byte_block;
@ This code opens the \.{DVI} file; clearly, a failure is fatal.
@<Open \.{DVI} file@>=
{AKT: removed display_ln ('Opening DVI file ' );}
if not open_and_ask(dvi_file, dvi_indx, dvi_name, dvi_def)
then abort('Could not open DVI file!');
@.Fatal: couldnt open@>@.Opening DVI file@>
@ But when we come to open a font file, we merely report a failure:
@<Open font file@>=
font_ok := open_and_ask (tfm_file, tfm_indx, tfm_name, tfm_def) ;
@ Here is the procedure that actually opens files. It searches for a file
called |name|, supplying missing bits from the default file-specification in
|other_name|. |f_f| is the file being opened, and |f_c| is its counter.
@<Medium...@>=
function open_and_ask
(var f_f: byte_file; var f_c: integer; var name: var_string;
other_name: def_name_type ) : boolean ;
label exit ;
var success : boolean;
begin
success := false;
repeat
success := open_binary(f_f, name, other_name) ;
if success then f_c := 0
else
@<Try to get an alternative name, |return| if this fails@>
until success ;
exit: open_and_ask:= success ;
end;
@ If this fails, then ask the user for another name. If the operating system
forbids this, or if the user refuses, then return |false| to indicate failure.
@<Try to get...@>=
begin
return; {AKT: don't ask user for another name}
warn ('Couldnt open file, search name was, ' );
string_show(name) ;
display_ln (' ') ;
display_ln ('default name was ' , other_name );
if can_interact then begin
ask_prompt('Please type a replacement or NO to abandon search' ) ;
name := buffer ;
if (name.len = 2) and
(name.data[1] = 'N') and (name.data[2] = 'O')
then return;
end else return;
end ;
@.error: couldnt open@>@.Please type...@>
@ \.{DVItype} has seven functions for reading integers from the \.{DVI} file
and two more for the \.{TFM} file. I have condensed these. In order for
these procedures to work, they must all have as parameters, both the file and
its attached counter. These macros generate the procedure calls.
@d read_end(#) == # @=)@>
@d skip(#) == skip_bytes @=(@> # @& file, # @& indx, read_end
@d get_integer(#) == read_integer @=(@> # @& file, # @& indx, read_end
@d get_byte(#) == read_byte(# @& file, # @& indx)
@d get_real(#) == read_real(# @& file, # @& indx)
@<Lowest...@>=
function read_byte(var f_file: byte_file; var f_indx: integer) : byte;
begin
if eof(f_file) then
warn('fallen off end of file' )
@.error: fallen off...@>
else begin
read_byte := f_file^[f_indx] ;
incr(f_indx);
if f_indx =block_length then begin
get(f_file );
f_indx:=0;
end;
end;
end ;
procedure skip_bytes(var f_file: byte_file; var f_indx: integer; n:integer);
{discard n bytes from |f_file|}
begin
if n < 0 then abort('skip_bytes called with negative number');
f_indx := f_indx + n;
while f_indx >= block_length do
begin
if eof(f_file) then
warn('fallen off end of file' )
else get(f_file );
f_indx := f_indx - block_length ;
end ;
end;
@.error: fallen off...@> @.Fatal: skip_bytes called...@>
@ The next function reads an integer from a file. |k| specifies the type.
|abs(k)| is the number of bytes, and the integer will be signed if |k<0|.
@<Lowest...@>=
function read_integer
(var f_file: byte_file; var f_indx: integer; k: integer): integer;
var a, i : byte; n: integer;
begin n := get_byte(f );
if (k < 0) and (n > 127) then n := n-256 ;
for i := 1 to abs(k) - 1 do
begin
a := get_byte(f ) ;
n := n*256 + a ;
end ;
read_integer := n ;
end;
@ A real number is stored in the file as 2 integers, numerator first.
@<Medium...@>=
function read_real(var f_file: byte_file; var f_indx: integer ): real_num;
var a, b: integer;
begin a := get_integer(f )(-4);
b := get_integer(f )(-4);
if b <= 0 then
begin
warn('denominator <= 0! '); read_real:= 1;
end
else read_real:= make_double(a)/make_double(b) ;
end;
@.error: denominator...@>
@* Page selection.
We have now disposed of all the code that is known to be system-dependent, so
we can resume a proper top-down description of the program. The basic method
for processing each page is that all printable characters are written onto a
structure called a `page image'. This is a list of things called `page
records'. Each page record represents one printable character, and contains
two fields giving the intended position on the page. Eventually the image will
be sorted and then copied to the |printfile|. This means that \.{Crudetype}
has to remember three sets of coordinates. In order to help to keep track of
many global variables, we use prefixes. \.{DVI} variables are prefixed with
|D_|, page image variables with |IM_|, and the printer's variables with |PR_|.
When this module starts, the \.{DVI} file should be positioned at or before a
@<For each page...@>=
{AKT: moved first read_BOP before repeat loop}
if (counter[0] >= first_page) then start := true ;
{AKT: what if counter[0] is < 0???}
if start and (max_pages > 0 )
then begin
decr(max_pages);
display('[', counter[0]:1, ']' ); {Progress report}
Read_one_page ;
@<Sort the page@>
Send_page ;
{AKT: removed @<Formfeed@>;}
end
else if max_pages > 0 then Skip_page
else time_to_stop := true;
{AKT: only call Formfeed BETWEEN pages; this requires some hackery}
read_BOP; {sets max_pages to -1 if no more}
time_to_stop := max_pages <= 0;
if (not time_to_stop) and start then begin
@<Formfeed@>;
end;
@ This program only gives a small subset of the page-selection facilities of
\.{DVItype}. The most you can do is to specify the starting page and the
maximum number of pages to print. This will be controlled by these variables:
@<Glob...@>=
start, time_to_stop: boolean;
first_page, max_pages: integer;
counter: array[0..9] of integer ;
@ @<Set init...@>=
start := false ; time_to_stop := false;
for in_i := 0 to 9 do counter[ in_i ] := 0 ;
@ |@!D_com| is the \.{DVI} command byte, |@!D_par| its first parameter.
@<Top level...@>=
procedure Read_one_page ;
var D_com: byte; D_par: integer; end_page: boolean ;
begin end_page := false;
@<Set up an empty page image@>
repeat
@<Get \.{DVI} command |D_com|, and do it@>
until end_page;
end ;
@#procedure Skip_page ;
var D_com: byte; D_par: integer; end_page: boolean ;
begin
end_page := false;
repeat
@<Skip \.{DVI} command, but we must process any |font_def|@>
until end_page;
end ;
@* Translating the device-independent file, 1: The big switch.
Refer to \.{DVItype} or to \.{TUG}boat (Vol.3, No.2) for a description of the
\.{DVI} file format. As in \.{DVItype}, we process each \.{DVI} command via a
big |case| statement. But 192 of the cases are very similar, so lets dispose
of them first.
@.TUGboat@>
@d id_byte=2 {identifies the kind of \.{DVI} files described here}
@d move_right ==
D_h := D_h + D_dis ;
IM_h := IM_h + IM_dis
@<Get \.{DVI} command...@>=
D_com := get_byte(dvi);
if D_com < 128 then begin
set_character(D_com); move_right ;
end
else if (D_com >= 171) and (D_com <= 234) then
change_font(D_com - 171)
else
@ @<Skip \.{DVI} command...@>=
D_com := get_byte(dvi);
if (D_com < 128)
or ((D_com <= 234) and (D_com >= 171))
then do_nothing
else
@ Now we come to the |case| statement proper. This section of the program is
long and complicated, and I have tried to clean it up. Some commands want an
unsigned parameter, called |D_par|, to be read from the file. We use
|four_cases| for those. Others want a signed parameter; they are all
movements. We use |move_cases| for those.
@d four_case_end(#) == # ; end
@d four_cases(#)==
#,#+1,#+2,#+3: begin D_par := get_integer(dvi)( D_com - # + 1 );
four_case_end
@d move_cases(#)==
#,#+1,#+2,#+3: begin D_par := get_integer(dvi)( # - D_com - 1 );
four_case_end
@<Get \.{DVI} command...@>=
case D_com of
four_cases(128) (set_character(D_par); move_right );
132: begin set_rule; move_right ; end;
four_cases(133) (set_character(D_par) );
137: set_rule ;
138: do_nothing ;
140: end_page := true ;
141: push;
142: pop;
move_cases(143) (D_h := D_h+D_par);
147:{W0} D_h := D_h+D_w ;
move_cases(148) (D_w := D_par; D_h := D_h+D_w );
152:{X0} D_h := D_h+D_x ;
move_cases(153) (D_x := D_par; D_h := D_h+D_x );
move_cases(157) (move_down(D_par));
161:{Y0} move_down(D_y);
move_cases(162) (D_y := D_par; move_down(D_y) );
166:{Z0} move_down(D_z);
move_cases(167) (D_z := D_par; move_down(D_z) );
four_cases(235) (change_font(D_par) );
four_cases(243) (define_font(D_par) );
@<Fourteen illegal cases: print suitable error messages@>
end ;
@ When skipping a page, we must throw away parameters instead of using them.
@d four_throw(#) ==
#,#+1,#+2,#+3: skip(dvi)(D_com - # + 1 )
@<Skip \.{DVI} command...@>=
case D_com of
four_throw(128);
132, 137: skip(dvi)(8); {sizes of a rule}
four_throw(133);
138: ;
140: end_page := true ;
141,142: ;
four_throw(143);
147: ;
four_throw(148);
152: ;
four_throw(153);
four_throw(157);
161: ;
four_throw(162);
166: ;
four_throw(167);
four_throw(235);
four_cases(243) (define_font(D_par) );
@<Fourteen illegal...@>
end ;
@ Finally, there are 14 illegal values of |D_com| that generate various errors.
@<Fourteen illegal...@>=
four_cases(239)
({AKT: removed warn('ignoring \special') ;} skip(dvi)(D_par) );
139, 247, 248, 249:
bad_dvi('byte: ', D_com:1 , ' out of context inside page' ) ;
250,251,252,253,254,255:
bad_dvi('Illegal command byte, ', D_com ) ;
@.error: cant do xxx@>
@.Fatal: Bad DVI file@>
@* Translating the device-independent file, 2: Paging and the stack.
The definition of \.{DVI} files refers to six registers, (|D_h, D_v, D_w,
D_x, D_y, D_z|), which hold integer values in \.{DVI} units. We shall need
additional registers in order to calculate a rounded position. From time to
time, we save the current values of these on a stack, represented by the
following arrays.
@d max_stack = 100 {\.{DVI} files shouldn't |push| beyond this depth}
@<Glob...@>=
D_h,D_v,D_w,D_x,D_y,D_z : integer; {current \.{DVI} state values}
D_h_stack, D_v_stack, D_w_stack, D_x_stack, D_y_stack, D_z_stack:
array [0..max_stack+2] of integer; {pushed down values }
@!stack_ht: 0..max_stack; {current stack depth}
just_pushed: boolean;
@ @<Set up an empty page image@>=
D_h := 0 ; D_v := 0 ;
D_w := 0 ; D_x := 0 ;
D_y := 0 ; D_z := 0 ;
stack_ht := 0 ;
rail_base := 0 ;
just_pushed := false ;
@ Here is how \.{DVI}type manipulates the stack: The first |push| on a page
fills the zeroth place on the stack and sets |stack_ht| = 1. So the used
places are numbered |0..stack_ht- 1|. Now |push| and |pop| do the obvious
things.
@<Lowest...@>=
procedure push;
var x: real_num ;
begin if stack_ht=max_stack then
warn('Capacity exceeded (stack size=', max_stack:1,')')
else begin
D_h_stack[stack_ht]:=D_h; D_v_stack[stack_ht]:=D_v;
D_w_stack[stack_ht]:=D_w; D_x_stack[stack_ht]:=D_x;
D_y_stack[stack_ht]:=D_y; D_z_stack[stack_ht]:=D_z;
@<Some adjustments are needed here for rounding@>
incr(stack_ht); just_pushed := true ;
end;
end;
@.error: Capacity exceeded @>
@# procedure pop;
begin if stack_ht=0 then warn('POP illegal at level zero')
else begin
decr(stack_ht);
D_h:=D_h_stack[stack_ht]; D_v:=D_v_stack[stack_ht];
D_w:=D_w_stack[stack_ht]; D_x:=D_x_stack[stack_ht];
D_y:=D_y_stack[stack_ht]; D_z:=D_z_stack[stack_ht];
IM_h := IM_h_stack[stack_ht];IM_v := IM_v_stack[stack_ht];
@<Set |rail_base|@>
end;
end;
@.error: POP illegal...@>
@ This procedure gets called when we expect to read a new page. It looks for
the next |BOP|; if it finds the postamble instead, it sets |max_pages < 0| as
a signal.
@d POST = 248
@d NOP = 138
@d BOP = 139
@<Top level...@>=
procedure read_BOP;
var k: byte ; D_par:integer ;
begin
repeat k:= get_byte(dvi);
if (k>= 243)and(k <= 246 ) then {a |font_def|}
begin D_par:=get_integer(dvi) (k-242 ); define_font(D_par); k:=NOP;
end;
until k<>NOP;
if k=POST then
max_pages := -1
else if k<>BOP then bad_dvi('byte is not BOP')
@.Fatal: Bad DVI file@>
else begin
for k:=0 to 9 do counter[k]:= get_integer(dvi)(-4);
skip(dvi)(4);
end;
end;
@ A \.{DVI}-reading program that reads the postamble first need not look at the
preamble; but \.{Crudetype} reads the \.{DVI} file sequentially.
@d PRE=247 {preamble}
@<Read \.{DVI} preamble@>=
bbb:= get_byte(dvi); {fetch the first byte}
if bbb<>PRE then bad_dvi('First byte isn''t start of preamble!');
@.Fatal: Bad DVI file@>
bbb:= get_byte(dvi); {fetch the identification byte}
if bbb<>id_byte then
warn('identification byte should be ',id_byte:1,', it is actually', bbb:1 );
@.error: identification...@>
@<Compute the conversion factors@>;
bbb:= get_byte(dvi); {fetch the length of the introductory comment}
{AKT: removed display(' ');}
for nnn := 1 to bbb do
{AKT: was display(zchr(get_byte(dvi)));}
bbb := get_byte(dvi);
{AKT: removed display_ln(' ');}
@ The conversion factor |h_conv| is figured as follows: There are exactly
|n/d| decimicrons per \.{DVI} unit and 254000 decimicrons per inch, and
|h_resolution| |h_steps| per inch.
@<Glob...@>=
dvi_factor, h_conv, v_conv, magnification : real_num;
nnn:integer; {general purpose register}
bbb: byte ;
@ @<Compute the conversion factors@>=
dvi_factor := get_real(dvi)/254000.0 ;
magnification := get_integer(dvi)(4) / 1000 ;
dvi_factor := dvi_factor * magnification ;
{This converts \.{DVI} units to inches (on an ideal device) }
h_conv:= dvi_factor * h_resolution * h_fudge * extra_mag;
v_conv:= dvi_factor * v_resolution * v_fudge * extra_mag ;
@* Translating the device-independent file, 3: Setting a Rule.
|D_p| is the height and |D_q| is the width. A rule has to be assembled from
the available characters. First: is the rule to be set at all? Second: is it
horizontal or vertical? (Because of the limited name lengths, we call them
|Post| and |Rail|.) The test applied here is quite arbitrary.
@<Medium...@>=
procedure set_rule;
var D_p,D_q: integer;
begin
D_p:=get_integer(dvi) (-4);
D_q:=get_integer(dvi)(-4);
if (D_p<=0)or(D_q<=0) then
{an invisible rule! Dont ask me why \TeX\ wants to do this}
else if D_p*v_conv <= post_height/2
then do_rail(D_p, D_q)
else do_post(D_p, D_q);
end;
@ Setting a vertical rule is simple: we just fill all the space with the
relevant character.
@<Lowest...@>=
procedure do_post(D_rul_ht, D_rul_width: integer);
var vn, vi, hn, hi, post_v, rule_hp : integer;
rule_cod: code_object ;
begin
@<|Post| set sizes@>;
for vi := vn - 1 downto 0 do
begin
post_v := IM_v - vi * post_height ;
for hi := 1 to hn do
begin
rule_hp := IM_h + (hi - 1) * post_width ;
do_set_char(post_v, rule_hp, rule_cod);
end;
end;
end;
@ Note that whereas \.{DVItype} rounds all sizes up, \.{Crudetype} rounds to
nearest integer. This seems more likely to work on a crude resolution.
But we force the rounded size to be |>= 1| .
@<|Post| set...@>=
round_IM_h ( 0);
hn := round(D_rul_width * h_conv / post_width );
vn := round(D_rul_ht * v_conv / post_height);
if hn <= 0 then hn := 1;
if vn <= 0 then vn := 1;
rule_cod := post_char;
@ A horizontal rule is more complicated, as there is then a selection of
characters. This matters if the printer has only a very coarse vertical
positioning. For example, a line printer has only minus and underscore, but a
VT-100 has 5 bars at different heights. |@!rail_types| should be set to the
number of different horizontal bars that the printer can draw within one
|v_step|. We measure the vertical position of a rule in |rail_steps|,
which are smaller than |v_steps| in the same ratio.
@<Glob...@>=
rail_chars : packed array [1..rail_types] of code_object ;
{Number from bottom of page up; so no. 1 might be an underscore}
rail_base : integer ;
{Position of bottom edge of a rule in |rail_steps|}
post_char : code_object ;
@ @<Const...@>=
@<Rule setting constants@>
{Printer-dependent, so they must go at the end of the file}
@ @<Lowest...@>=
procedure do_rail(D_rul_ht, D_rul_width: integer);
var vn, vi, hn, hi,
rail_v, {Current position in |rail_steps|}
char_vp, {Position in |v_steps| where a rule char will be set}
rule_hp: integer;
rule_cod: code_object ;
char_i : 1..rail_types ; {indicates which character to be used}
begin
@<|Rail| set sizes@>
for vi := vn-1 downto 0 do begin
rail_v := rail_base - vi ;
@ Now to assign |char_i| and |char_vp|. The easiest way is to consider a simple
example. Suppose |rail_types = 5| and |rail_v = 50|. This addresses the
underscore at the bottom edge of a text character at |10 v_steps|.
So |char_i| wants to be 1 and |char_vp| 10. So...
@<Lowest...@>=
char_vp := ((rail_v - 1) div rail_types ) + 1 ;
char_i := rail_types - ((rail_v - 1) mod rail_types ) ;
rule_cod := rail_chars [ char_i] ;
for hi := 1 to hn do begin
rule_hp := IM_h + (hi-1) * rail_width ;
do_set_char(char_vp, rule_hp, rule_cod) ;
end;
@ @<|Rail| set...@>=
round_IM_h ( 0);
hn := round(D_rul_width * h_conv/ rail_width);
vn := round(D_rul_ht * v_conv * rail_types/ rail_height );
if hn <= 0 then hn := 1;
if vn <= 0 then vn := 1;
@ Now consider how to set |rail_base|. Horizontal rules are mostly used for
underlining text, and then they should be aligned with the underscore
character on the same line of text. So normally, we just do the following. The
exception occurs when the \.{DVI} file does an explicit vertical move.
@<Set |rail_base|@>=
rail_base := IM_v * rail_types ;
@* Translating the device-independent file, 4: Changing and defining Fonts.
The following tables describe all the \TeX\ fonts that \.{Crudetype}
currently knows about.
@<Glob...@>=
nf: D_font_ptr ;
{The number of fonts so far defined. These will be numbered |0..nf-1| }
@!font_num, {external font numbers}
@!font_space, {boundary between ``small'' and ``large'' spaces}
@!scheme, {pointer to coding scheme}
@!first_ch, {First character in the font}
@!last_ch: {and last}
array [D_font_ptr] of integer;
D_width: array[D_font_ptr, D_char_ptr ] of integer ;
{character widths, as given in \.{TFM} file, should be in \.{DVI} units}
@!D_check, {the font checksum must be global for HPGF}
thin_space, D_font, cur_scheme: integer ; {The current values}
@ @<Type...@>=
D_font_ptr = 0..max_D_fonts;
D_char_ptr = 0..max_D_char;
@ The size of the tables can be altered at compile time to extend or reduce
\.{Crudetype}'s capacity.
@<Constants...@>=
@!max_D_fonts=100; {maximum number of distinct fonts per \.{DVI} file}
@!max_D_char =255; {AKT: was 127 but we want to handle PostScript fonts}
@ Initially, all these tables are blank.
@<Set init...@>=
nf:=0;
for in_i := 0 to max_D_fonts do
begin
font_num[in_i ] := 0 ;
scheme[in_i ] := 0 ;
first_ch[in_i ] := 0 ;
last_ch[in_i ] := 0 ;
font_space[in_i]:= 0 ;
end;
@ @<Set up an empty page image@>=
D_font := nf ;
cur_scheme := 0 ;
@ @<Medium...@>=
procedure change_font (D_new: integer);
begin
D_font := 0 ;
font_num[nf]:=D_new;
while font_num[D_font]<>D_new do incr(D_font);
if D_font = nf then
warn('Undefined font called for, number ', D_new:1 );
@.error: Undefined font@>
cur_scheme := scheme[D_font] ;
thin_space := font_space[D_font] ;
end;
@ The following procedure is called whenever we read a |font_def| command from
the \.{DVI} file. In general, any error while defining a font causes a jump to
label |bad_font|, leaving the new font undefined.
@d bad_font = 9999
@d good_font = 9998
@d font_error(#) == begin
warn(#); display_ln('font number ', D_new:1, ' cannot be loaded') ;
goto bad_font ;
@<Medium...@>=
procedure define_font (D_new:integer );
label bad_font , good_font ;
var @<|font_def| vars@>
begin
@<Read the font parameters from the \.{DVI} file,
calculate scaling factors@>;
@<Try to load the new font, unless there are problems@>;
good_font:
@<Final checks; various mild errors which often are symptoms of bugs@>
incr(nf) ; {the new font is officially present}
bad_font: if font_ok then close_binary(tfm_file);
end;
@ First we read the parameters from the \.{DVI} file. Whatever errors are
found, we must try to do this, or we lose place in the file.
@<|font_def| vars@>=
scale_size, design_size, k, f : integer;
dir_len, {length of the area/directory spec}
nam_len:byte; {length of the font name proper}
font_mag: real_num;
@ @<Read the font parameters...@>=
@!D_check := get_integer(dvi)(-4) ;
scale_size:= get_integer(dvi)( -4) ;
design_size:= get_integer(dvi)(-4) ;
dir_len:= get_integer(dvi)(1) ;
nam_len:= get_integer(dvi)(1) ;
nam_len := nam_len + dir_len ;
if nam_len = 0 then
font_error('null font name! ')
@.error: null font name@>
else if nam_len >= string_length then
font_error('too-long font name! length = ', nam_len:1 ) ;
@.error: too-long font name@>
tfm_name := blank ;
for k:=1 to nam_len do begin
tfm_name.data[k] := zchr(get_byte(dvi)) ;
end;
tfm_name.len := nam_len ;
upcase(tfm_name) ;
{AKT: removed
display_ln(' ');
string_show(tfm_name);
display(' ');
@ Next, check that the sizes are reasonable:
@<Read the font parameters...@>=
if (scale_size<=0)or(scale_size>=@'1000000000) then
font_error('--- bad scale (',scale_size:1,')!')
@.error: bad scale@>
else if (design_size<=0)or(design_size>=@'1000000000) then
font_error('--- bad design size (',design_size:1,')!') ;
@.error: bad design size@>
font_mag := scale_size/design_size ;
if (font_mag > 1000) or (font_mag < 0.001) then
warn('thats a very unusual font magnification!!! ', font_mag) ;
@.error: unusual font mag...@>
if nf=max_D_fonts then
abort('Crudetype capacity exceeded (max fonts=', max_D_fonts:1,')!');
@.Fatal: Capacity exceeded... @>
font_num[nf]:=D_new; f:=0;
while font_num[f]<>D_new do incr(f);
if f<nf then font_error('---this font was already defined!');
@.error: font already defined@>
font_space[nf] := scale_size div 6 ; {a `thin space' }
@* Loading the font file.
See \.{TFTOPL} or \TeX 82 for details of the \.{TFM} file format. The
description given in \.{TUGboat} (Vol.2, no. 1) is apparently no longer
accurate. The only difference that I have seen is that all words of the font
header array after the first 2 are now apparently regarded as optional.
@.TFTOPL@> @.TeX82@> @.TUGboat@>
@<Try to load...@>=
@<Open font file@>
if not font_ok then
font_error('---TFM file can''t be opened!');
@.error: TFM file cant be opened@>
@<Read past the header data, leave the file pointer just after the header@>
@<Read the character-width indices@>
@<Read the widths, copy them into the font array@>
@ @<|font_def| vars@>=
@!TFM_check,
@!lh, {length of the header data, in four-byte words}
@!nw:integer; {number of words in the width table}
@ @<Read past the header...@>=
skip(tfm)(2); lh:= get_integer(tfm)(2);
first_ch[nf]:=get_integer(tfm)(2); last_ch[nf]:=get_integer(tfm)(2);
if (last_ch[nf]<first_ch[nf]) or (last_ch[nf] > max_D_char) then
font_error(
'Illegal values for first_char and/or last_char, first_char = ',
first_ch[nf]:1 , ' last_char = ', last_ch[nf]:1 );
@.error: Illegal value@>
nw:=get_integer(tfm)(2);
if (nw=0)or(nw>256) then
font_error('Illegal value for nw, nw= ', nw );
@.error: Illegal value@>
skip(tfm)(14);
TFM_check := get_integer(tfm)(-4);
skip(tfm)(4);
@<Get coding scheme and re-align file, then see if the printer knows it@>
@ The header contains |4*lh| bytes, of which 8 have been read so far. If it
conforms to the \.{TUGboat} format, then the next byte (|@!ck|, say) is the
number of bytes in the coding scheme name. So, first we must try to see if a
scheme is present; if so, then we will read |ck+1| bytes and chuck the rest.
If no coding scheme is present, we simply skip the rest of the header.
Internally, scheme names are represented by |var_string|s.
@<Get cod...@>=
TFM_scheme := blank ;
if lh < 2 then font_error( ' Header must have at least 2 words')
else if lh = 2 then do_nothing
else begin
ck := get_byte(tfm);
if ( ck >= 40 ) or ( ck > 4*lh - 9) then
skip(tfm)(4*lh - 9)
{there is something here, but not a coding scheme}
else begin
TFM_scheme.len := ck ;
for j := 1 to ck do
TFM_scheme.data[j] := zchr(get_byte(tfm)) ;
skip(tfm)(4*lh - ck - 9);
upcase(TFM_scheme) ;
end;
end;
@ @<|font_def| vars@>=
j , ck : byte ;
@!coding_scheme, TFM_scheme: var_string ; {coding scheme of current font}
@ Now we can start reading the character widths.
@<|font_def| vars@>=
@!in_width:array[byte] of integer; {\.{TFM} width data in \.{DVI} units}
@!wid_ptr: array[byte] of byte ; {pointers into |in_width|}
b3,b2,b1,b0: byte; {bytes from \.{TFM} file}
@!alpha,@!beta, @!z :integer;
@ @< Read the character-width indices...@>=
for k:=first_ch[nf] to last_ch[nf] do
begin wid_ptr[k] := get_byte(tfm); skip(tfm)(3);
if wid_ptr[k] > nw then font_error('impossible width ' , wid_ptr[k]);
end;
@.error: impossible width @>
@ Here is the width computation. This code is copied from \.{DVItype}. See
that program for an explanation.
@<Read the font parameters...@>=
z := scale_size ;
alpha:=16*z; beta:=16;
while z>=@'40000000 do
begin z:=z div 2; beta:=beta div 2;
end;
@ @<Read the widths...@>=
for k:=0 to nw-1 do
begin
b0 := get_byte(tfm); b1 := get_byte(tfm);
b2 := get_byte(tfm); b3 := get_byte(tfm);
in_width[k]:=
(((((b3*z)div@'400)+(b2*z))div@'400)+(b1*z))div beta;
if b0 = 255 then in_width[k]:=in_width[k]-alpha
else if b0 <> 0 then
font_error('Out-of-bounds value for b0') ;
@.error: font: Out-of-bounds |b0|@>
end ;
@ Rounding widths. This bit of \.{DVItype} is changed, because \.{Crudetype}
has to calculate rounded positions by a completely different method.
@<Read the widths...@>=
if in_width[0]<>0 then font_error('the first width should be zero ');
@.error: first width...@>
for k:= first_ch[nf] to last_ch[nf] do
D_width[nf, k] := in_width[ wid_ptr[k]] ;
@ Then there are various erroneous states that do not necessarily show that
the font is corrupt, but may indicate bugs in the program. In principle, a
character might have negative width, but I do not believe it.
@d bad_char = -32766 {Indicates an unprintable character}
@d foot == 50000000 {about a foot}
@<Final checks...@>=
for k:= first_ch[nf] to last_ch[nf] do
if (D_width[nf, k] < 0) or (D_width[nf, k] > foot) then begin
warn('Way-out width = ', D_width[nf,k]:1,
'DVI units, character number ', k:1 );
codes[ scheme[nf], k].breadth := bad_char ;
end;
if (D_check<>0)and(TFM_check<>0)and(D_check<>TFM_check) then
begin warn('check sums do not agree!');
@.error: check sums...@>
display_ln('DVI check was: ', D_check, ' TFM check was: ', TFM_check);
display(' ');
end;
{AKT: removed display_ln('---loaded at size ',scale_size:1,' DVI units');}
font_mag := (font_mag -1) * 100.0 ;
{AKT: removed
if abs(font_mag) > 1 then
begin display_ln(' ');
display_ln(' (this font is magnified ', round(font_mag):1,'%)');
end;
@.this font is magnified@>@.error: Way-out width@>
@* Coding schemes.
In this section we describe the mapping from characters in \TeX\ fonts to
characters in the printer's fonts (which are presumably much fewer). All
characters on a crude printer are the same size. We therefore need one piece
of data, not for each \TeX\ font, but for each coding scheme. The mapping is
defined in an array called |codes|. For each character |c| in a \TeX\ font
whose coding scheme has internal number |s|, |codes[s,c]| describes the
corresponding printer character. Also |known_schemes[s]| is a character
string which usually contains the coding scheme of that \TeX\ font.
|max_codes| is the number of coding schemes the program knows about. First,
define that structure:
@ @<Glob...@>=
@!known_schemes: array[code_ptr] of var_string ;
@!codes: array[code_ptr, D_char_ptr] of code_object;
no_char: code_object ;
@ @<Types...@>=
code_object = packed record
breadth: i_word ;
case boolean of
true: (IM_font: byte ; IM_char: byte );
{Printers font and character}
false: (multi: i_word) ;
end;
@! code_ptr = 0..max_codes;
{0 is a coding scheme the printer doesnt know about}
@ Initially, all these tables are blank. If |c| is a |code_object|, then
|c.breadth| will usually be its printed width in |h_steps|. |c.breadth =
bad_char| indicates that the character is unprintable. |bad_char| can be any
large negative value. Other negative values of |@!breadth| indicate other
types of peculiar characters.
@d down_loaded = -32765
@<Set init...@>=
no_char.breadth := bad_char ;
no_char.IM_font := 0 ;
no_char.IM_char := 0 ;
for in_i := 0 to max_codes do
for in_j := 0 to max_D_char do begin
codes[in_i, in_j] := no_char ;
end;
@ So when a font is read in, we try to assign the right value to its |scheme|.
If the printer is not absolutely crude, then it might have italic or bold
fonts. Then we might want a coding scheme to correspond to a single \TeX\
font. So first we look at the actual font name and see if that matches any
of the |known_schemes|. But if the printer is |fixed_width|, then all fonts
of the same face are the same size, so we drop the font size digits off the
end of the name.
@<Get cod...@>=
k := tfm_name.len ;
if fixed_width then
while (zord(tfm_name.data[k]) >= "0" ) and
(zord(tfm_name.data[k]) <= "9" ) do
decr(k) ;
coding_scheme:= tfm_name;
chop_length(coding_scheme, k) ;
j := max_codes ;
while (j > 0) and (coding_scheme.data <> known_schemes[j].data ) do decr(j);
scheme[nf] := j ;
if j = 0 then
@ If the font name is not in |known_schemes|, then we try again with the
scheme given in the \.{TFM} file. If that fails, then try if we can download
the font. If that fails, then the font is deemed to be unprintable, so we do
not load it.
@<Get cod...@>=
begin
j := max_codes ;
while (j > 0) and (TFM_scheme.data <> known_schemes[j].data ) do decr(j);
scheme[nf] := j ;
end;
if (j = 0) and can_dl_font then
@<Download a whole font@>
else if j = 0 then begin
scheme[nf] := 9 ; {AKT: handle PostScript font }
{AKT: was
display ('Scheme is: ') ;
string_show(TFM_scheme) ;
font_error(' That coding scheme is unknown' );
AKT}
end;
@.error: unknown coding scheme @>
@ This procedure sets a character. The character to be set is number |@!c_num|
in the current font. I have deleted the bit of \.{DVItype} that deals with
oriental fonts, as I dont believe that crude printers can support them.
@<Medium...@>=
procedure set_character(c_num: integer );
var cod: code_object;
d_i, d_j : integer; {Used for downloading}
begin
if cur_scheme = 0 then
else if (c_num < first_ch[D_font] ) or (c_num > last_ch[D_font] )
then begin
warn('character ',c_num:1,' invalid in font number ',
font_num[ D_font]:1 );
@.error: character invalid...@>
end
else begin
cod := codes[ cur_scheme, c_num];
if cod.breadth <> bad_char then begin
round_IM_h( c_num) ;
if cod.breadth = down_loaded then
@<Enter a download request for |cod| and adjust its |breadth|@> ;
do_set_char(IM_v, IM_h, cod ) ;
@<Do messy things to adjust the positions |D_h|, |IM_h|, etc@>;
end;
end;
end;
@ @<Forw...@>=
procedure do_set_char(Set_v, Set_h: i_word; cod: code_object ); forward;
@ @<Lowest...@>=
procedure do_set_char ;
var k_i, k_k, temp_v, temp_h: i_word ;
m_c: code_object ;
k_ptr: 1..max_ligs;
begin
if cod.breadth >= 0 then begin
@<Check the position@>
@<Add the record to the page image@>
end
else if cod.breadth = bad_char then do_nothing
else @<Set multi-character command@> ;
end;
@* Multiple-character commands.
Several crude printers (e.g. daisy-wheels) have only a limited set of
characters, which cannot be extended. Sometimes you can generate more
characters by overstriking. \.{Crudetype} can be programmed to do this, by
placing suitable entries into a table called |ligatures|. The name is chosen
by analogy with the |lig_kern| programs in \.{TFM} files, but the data is
completely different. When one \TeX\ character maps onto several printer
characters, we call the image a `multi-character' command.
@<Const...@>=
max_ligs = 10000 ;
@ @<Glob...@>=
ligatures : array[1..max_ligs] of lig_thing;
top_of_ligs: 0..max_ligs ; {highest used point in |ligatures|}
@ @<Types...@>=
trio = 1..3 ;
lig_thing = packed record
case trio of
1: (v_move: i_word ;
h_move: i_word) ;
2: (code: code_object) ;
3: (num : i_word ;
guard : i_word) ;
end;
@ @<Set init...@>=
top_of_ligs := 0;
for in_i := 1 to max_ligs do ligatures[ in_i].code := no_char ;
@ The |code_object| addresses a multiple character when its |breadth| is
negative, and not one of the special classes defined above. It must then be
the |false| variant, and its |multi| field (which must be |>0|) points to the
corresponding entry in |ligatures|. Suppose that field is |c| . Then
|ligatures[c]| is the last entry of a string of items that defines the
replacement text of the |code|. It should be of the third variant; The |num|
field of this entry counts the number of characters that |code| expands into.
The |guard| field is an arbitrary impossible value called |sentry| to give a
check on the data in |ligatures| .
@d sentry = -32767
@<Set multi...@>=
begin
if (cod.multi <= 0) or (cod.multi > top_of_ligs) then
warn('Illegal value of char in multi-character command')
@.error: Illegal value@>
else begin
k_ptr := cod.multi ;
if ligatures[k_ptr].guard <> sentry then
warn('Sentry not found in Kerns ' ) ;
@.error: Sentry ...@>
k_i := ligatures[k_ptr].num ;
k_ptr := k_ptr - 2*k_i ;
if (k_i <= 0) or (k_ptr < 0 ) then
warn('Illegal value of k_i in multi_character command');
@.error: Illegal value@>
for k_k := 1 to k_i do
@<Get that character and write it @>;
end;
end
@ Each character of a multi-character command needs 2 entries in |ligatures|.
The first defines the position, the second defines the character. |v_move| and
|h_move| are relative to the current (rounded) position |Set_v, Set_h| and use
the same units. A multi-character command can call another one recursively.
@<Get that character ...@>=
begin
temp_v := Set_v + ligatures[k_ptr].v_move ;
temp_h := Set_h + ligatures[k_ptr].h_move ;
incr(k_ptr);
m_c := ligatures[k_ptr].code ;
do_set_char(temp_v, temp_h, m_c ) ;
incr(k_ptr);
end;
@* Getting data into the |codes| array.
This is clearly a very long and error-prone job, so the next procedures are put
in to reduce this. First suppose that: in the \TeX\ coding scheme with number
|s|, a run of |length| characters starting from |start| maps onto a run of
consecutive characters in printer font |PR_font|, starting at |PR_first|. This
procedure will enter the whole run at one go.
@<Lowest...@>=
procedure alphabet
(start, length: byte; s: code_ptr ; PR_font, PR_first : byte );
var i:integer; ccc:code_object;
begin @<Check alphabet data@>;
ccc.IM_font := PR_font ;
ccc.breadth := char_width ;
for i := 0 to length-1 do begin
ccc.IM_char := PR_first +i;
codes[s, start+i] := ccc ;
end; end;
@ @<Check alph...@>=
if (s < 1) then abort('alphabet: scheme < 1 ')
else if (s > max_codes) then abort('alphabet: scheme too large')
else if (PR_first < 0) then abort('alphabet: negative first')
else if (start < 0) then abort('alphabet: negative start')
else if (length < 0) then abort('alphabet: negative length')
else if (start + length -1 > max_D_char) then abort('alphabet: overflow')
@.Fatal: alphabet...@>
@ Clearly, |alphabet| will only cover a very small part of the problem. The
next procedure enters data into a subset of the |codes| array corresponding to
a single row of a \TeX\ font. In the standard font tables, row number |m| is
the subrange |8*m..8*m+7| of a font. It is hoped that when the calls of
procedure |row| are written out in a program, the result will be (just about)
legible, whereas a flood of statements like \begintt
codes[i,j].IM_font := 121; \endtt
is certainly not legible.
The parameters are as follows. |@!row_spec| specifies what characters are to
go into the row. |@!scheme | is the number assigned to the \TeX\ coding scheme
within the program. |@!row_num | is the number of the row in that scheme
(starting from 0). |@!first_font| is the initial printer font.
@<Top...@>=
procedure row
(row_spec: row_str; scheme, row_num: integer; first_font: i_word );
var n :integer; codd: code_object;
begin
incr(row_count);
row_pt := 1 ;
row_font := first_font ;
row_string := row_spec;
for n := 8*row_num to 8*row_num + 7 do begin
row_char (0 ,codd);
if codd.breadth = bad_char then do_nothing
else codes[ scheme, n ] := codd ;
end;
end;
@ \.{TANGLE} imposes a limit of 69 on the length of quoted strings. This is a
considerable nuisance, as we could make the |row_spec| strings look much better
if they could be longer.
@<Const...@>=
row_length = 69 ;
@ @<Types...@>=
row_str = packed array [1..row_length] of char ;
@ @<Glob...@>=
row_pt: integer; {Points to next char from |row_spec|}
row_font: i_word; {printer font being addressed during the |row| procedure}
row_string: row_str ;
row_count: integer ;
@ @<Set init...@>=
row_count:= 0 ;
@ In order to help debugging, error messages will print |row_string| and a
pointer. The diagnostics of |row| are known to be very poor; I have not
bothered to fix them because up to now they have been adequate, and they are
really meant for the installer rather than the end user.
@d row_warn(#) == begin
display_ln(row_string ) ;
display_ln('^' : row_pt-1 ) ;
warn('Row: ', #);
return;
@ The overall format of the |row_spec| is a set of 8 character specifiers
separated by one or more spaces. The procedure |row_char| reads one character
specifier from the |row_string|, and constructs the specified |code_object|.
Logically, |row_char| should be a function and return that |code_object| as
its value. \PASCAL\ does not permit this. So we assemble the result in the
variable parameter |value|.
@<Medium...@>=
procedure row_char(context: integer; var value: code_object);
label exit ;
const @<|Row_char| constants@>
var c :byte; @<Row locals@>
begin
value.breadth := char_width ; {default}
value.IM_font := row_font ; {default font}
c := row_get ;
if ( context = 0) and ( c <> " " ) and ( c <> "Z") then
row_warn('Character specifiers must start with at least one space') ;
while ( c = " " ) do c := row_get ;
@<Escape sequences in the |row_spec| @>
else value.IM_char := c ;
exit: end;
@.error: Row: Character spec...@>
@ There are several escape sequences that need to go into the |rowstring|.
Since all the PLAIN.TEX coding schemes (except the math extension one) have
the upper case Roman characters in their ASCII positions, these characters
will surely be inserted into |codes| by the |alphabet| procedure. So they are
available as flag characters. But the brackets are also used as flags, as they
are so much more perspicuous than anything else. Here is a list of the
characters currently used as escapes: \begintt
A C D E F K L M N S Q U W Z \endtt
This list should be updated if other escapes are added .
@.Escape sequences@>@.ASCII@>
@ Some characters, called `bad', have most undesirable effects when used in
\.{WEB} strings. So the following upper-case letters stand for them. The
actual characters may not be used, so they generate errors.
@<Esc...@>=
if c = "A" then value.IM_char := 64 {at sign}
else if c = "S" then value.IM_char := 32 {a space}
else if c = "Q" then value.IM_char := 39 { a single quote char}
else if c = "W" then value.IM_char := 34 { a double quote char }
else if c = "E" then value.IM_char := 127 { a delete char }
else if (c = "'") or (c = """") or (c = "@@") or (c = 127)
then row_warn( 'Bad character---Rejected' )
else if (c = " ")
then row_warn('space found out of context')
@.Error: Row: Bad character@>@.error: Row: space found...@>
@ Then the `Z' escape is provided to generate a do-nothing code. This would
be used if a previous call (say, of |alphabet|) had left a row partly
incorrect. Then you might issue a call of |row| to change that row. Typing `Z'
at the positions occupied by correct characters would leave them alone.
@<Esc...@>=
else if c = "Z" then value.breadth := bad_char
@ Since many letters and brackets are used as escapes, the `L' escape is
needed to enable them to be used Literally. `LL' generates `L'.
@<Esc...@>=
else if c = "L" then value.IM_char := row_get
@ In order to address printer characters in the range 0..32, where ASCII has
no graphics, here is a Control escape. This simply reads the next character
from the |row_spec| and reduces it modulo 32. It is best to use the lower case
alphabet (the range 95..126) as this avoids all the `bad' characters (and
their escapes). So control-A should be typed `Ca' , not `CA' .
Then the Meta escape addresses meta-characters, i.e. those in the range
128..255. We cannot just read a character and add 128, as we might want to
Mutate the ASCII controls, or the `bad' characters. So `M' must read a complete
|code_object| (respecting the escapes given above) and add 128 to its |IM_char|
field. So we must say `MS' for `meta-space' = 160 , and `MLS' for `meta-S'
= 211 .
@.ASCII@>
@d M_con == 250
{Context while reading a Meta character}
@<Esc...@>=
else if c = "C" then
value.IM_char := row_get mod 32
else if context >= M_con then value.IM_char := c
{During a Meta, forbid any of the later escapes}
else if c = "M" then
begin
row_char(M_con , value ) ;
value.IM_char := value.IM_char + 128 ;
end
@ A |narrow| character is one with zero width. To generate one, precede it
with an `N' . To mark a character as down-loadable, precede it with `D'. A
character cannot be both narrow and down-loadable.
@d N_con == 230
{Context while reading a Narrow or |down_loaded| character}
@<Esc...@>=
else if ((c = "N" ) or (c = "D" )) and (context >= N_con) then
row_warn('Narrow or Down escape out of context')
@.error: Row: Narrow escape...@>
else if c = "N" then
begin
row_char(N_con, value ) ;
value.breadth := 0 ;
end
else if c = "D" then
begin
row_char(N_con, value ) ;
value.breadth := down_loaded ;
end
@ Changing printer fonts in the middle of a |row| is done by inserting an `F'
character, followed by an integer. This is the printer font to be used, from
now on till the next `F' . Note that the initial font was passed as the 3rd
parameter to |row|.
@<Esc...@>=
else if c = "F" then begin
row_font := row_integer ;
if context = 0 then row_char(1, value)
else row_char(context, value);
end
@* Assembling a multi-character in |row|.
Now we come to the difficult part, which is assembling a multiple-character
command into the |ligatures| array. For this purpose, we use brackets. Curly
brackets mean that the characters inside are to be overstruck, square brackets
mean they are to be typed horizontally, and angle brackets mean that they are
to be typed vertically above each other. Finally the `U' escape (which must
come immediately after a |<| ) means to raise the (logical) cursor before
starting the vertical list.
Warning!! I use the numerical (\.{ASCII}) values of these chars
@.System dependencies@>@.ASCII@>
@<|Row_char| const...@>=
o_bra = "{" ; o_ket = "}" ;
h_bra = "[" ; h_ket = "]" ;
v_bra = "<" ; v_ket = ">" ;
{`o' means overstrike, `h' means horizontal, and `v' vertical}
@ So if we want to generate a Macsyma style summation sign, which looks like
this: \begintt
. ====
. \
. >
. /
. ====
\endtt
we have to insert the following mess into the |row_spec| string: \begintt
<S[====]\[SL>]/[====]>
\endtt
The `S' is needed to get correct vertical alignment. The `L' is needed to
prevent the following |>| being taken as a |ket|. See the lineprinter change
file for examples.
@ In order to keep some control over all these escape sequences, I have made a
special rule of syntax. The escape sequences in |row_char| may only be nested
in a definite order. That order is: (bad characters or Control or Literal)
inside Meta inside (Narrow or Down-loadable) inside Font inside over-lists
inside |h_list|s inside |v_list|s. The parameter |context| keeps track of
this. The innermost constructions have the highest values of |context|. If
these rules are broken the user should get an error message saying `Row' and
some diagnostics.
@<Esc...@>=
else if (c = o_bra) or (c = h_bra) or (c = v_bra)
then begin
if context >= c then
row_warn('Illegal nesting of brackets in row_spec');
@.error: Row: Illegal nesting @>
@<Assemble characters into |lig_buff| until we read the matching |ket|@>;
@<Copy |lig_buff| into |ligatures| and return a pointer to it@>;
end
@ |hoister| and |ender| are arbitrarily selected impossible classes for a
character, indicating respectively that a |v_list| has to be raised one
|char_ht| or that a |ket| has been read.
@d hoister = -32764
@d ender = -32763
@<Assemble...@>=
for i := 1 to max_buf do lig_buff[ i].code := no_char ;
buf_len := 0; delta_h := 0; delta_v := 0;
repeat
row_char(c ,row_cod ) ;
@<Do suitable action if |row_cod| is peculiar@>
else begin
incr(buf_len);
lig_buff[buf_len].v_move := delta_v ;
lig_buff[buf_len].h_move := delta_h ;
incr(buf_len);
lig_buff[buf_len].code := row_cod ;
if c = v_bra then delta_v := delta_v + char_ht;
if c = h_bra then delta_h := delta_h + char_width ;
end;
until row_cod.breadth = ender;
@ @<Row loc...@>=
lig_buff: array[1..max_buf] of lig_thing ;
buf_num: 0..max_buf ;
{Number of characters (or multi-characters) in current list}
buf_len: 0..max_buf ;
{Number of used locations in |lig_buff|: should be |2*buf_num|}
delta_h, delta_v: i_word;
i : integer;
row_cod: code_object ;
@ @<Const...@>=
max_buf = 201;
@ @<Do suitable action...@>=
if row_cod.breadth = hoister then delta_v := delta_v - char_ht
else if row_cod.breadth = ender then
else if buf_len + 3 > max_buf then
abort('overflowed lig_buff array')
@.Fatal: overflowed |lig_buff|@>
@ @<Esc...@>=
else if (c = "U" ) and (context = v_bra) then value.breadth := hoister
else if (c = "U" ) then
row_warn('U escape out of context')
@.error: Row: U escape...@>
else if ((c = o_ket) or (c = h_ket) or (c = v_ket)) and (context = c-2)
then value.breadth := ender
else if (c = o_ket) or (c = h_ket) or (c = v_ket)
then row_warn('mismatching brackets ')
@.error: Row: mismatching brackets@>
@ Yet another escape is the |kern| escape. If the printer has reasonable
positioning resolution, we may want to move the parts of a multi-character
about to make them fit together better. So a |kern| takes an integer parameter
and moves the next component of the current list by that many |steps| against
the current direction. The reason for going back is that one can easily move
forwards by setting a blank space.
@d h_kern = -32762
@d v_kern = -32761
@<Esc...@>=
else if (c = "K") and (context = h_bra) then value.breadth := h_kern
else if (c = "K") and (context = v_bra) then value.breadth := v_kern
else if (c = "K")
then row_warn('Kern escape out of context' )
@.error: Row: Kern escape @>
@ @<Do suitable action...@>=
else if row_cod.breadth = h_kern then delta_h := delta_h - row_integer
else if row_cod.breadth = v_kern then delta_v := delta_v - row_integer
@ @<Copy...@>=
buf_num := 0 ;
if buf_len = 0 then value.breadth := bad_char
else if top_of_ligs + buf_len + 1 >= max_ligs then
abort ('ligature array overflowed, must recompile with larger array')
@.Fatal: ligature overflowed@>
else begin
for i := 1 to buf_len do
ligatures[ top_of_ligs + i ] := lig_buff[i] ;
top_of_ligs := top_of_ligs + buf_len + 1 ;
buf_num := buf_len div 2 ;
ligatures[top_of_ligs].num:= buf_num ;
ligatures[top_of_ligs].guard := sentry ;
value.multi := top_of_ligs ;
value.breadth := -20000 ;
{Provisional: a nonsense value to make sure the correct value does get
inserted later}
end;
@ Finally, here are the two functions that actually read the |row_spec|. The
first is |row_integer|. This reads an integer parameter for the |font| and
|kern| escapes. The parameter may have a minus sign and is terminated by the
next non-digit. (If there needs to be another digit immediately after the
parameter, then prefix it with an `L').
@<Lowest...@>=
function row_integer: integer;
label exit ;
var neg: boolean ; n: integer ; b: byte ;
begin
n := 0 ; neg := false ; b := row_get ;
if (b = "-") then begin
b := row_get ; neg := true;
end
else if ( b="+") then b := row_get ;
if (b < "0" ) or (b > "9" ) then
row_warn( 'no digits found by row_integer')
@.error: Row: no digits @>
else
repeat
n := n*10 + b - "0" ;
b := row_get ;
until (b < "0" ) or (b > "9" ) ;
if neg then n := -n ;
exit: row_integer := n ; decr( row_pt) ;
end;
@ And this function gets the next character from the |row_spec|. I always have
great difficulty with this sort of program, so will go carefully. Recall:
|row_pt| points to the next character we are going to read. |b| is that
character, translated into a byte by |zord|.
@<Forw...@>=
function row_get : byte; forward;
@ @<Lowest...@>=
function row_get;
label exit ;
var b: byte ;
begin
b:= 127 ; {any bad character}
if row_pt > row_length then
row_warn('fallen off end of row_string')
@.error: Row: fallen off end@>
else begin
b:= zord(row_string[row_pt]) ;
incr(row_pt);
end;
exit: row_get := b ;
end;
@* Character strings.
In this section I have tried to provide some tolerable string-handling
facilities in despite of the restrictions of \PASCAL. This does not seem to
belong in any particular place in the program, but in view of the horrible
gruesome things that will happen in the next section, it seemed a good idea to
give some light relief. That is why this section is inserted here.
The |var_string| type is principally used for file names and to send command
sequences to the printer. Logically, these procedures should all be functions
and return the results, but stupid \PASCAL\ does not allow this. It would of
course be much cleaner to use the VMS |varying| type, but that would make the
program non-portable.
@<Const...@>= string_length = 100 ; {a guess, of course}
@ @<Types...@>=
s_ptr = 1..string_length ;
s_dat = packed array[ s_ptr] of char ;
var_string= packed record
len: byte;
data: s_dat ;
end ;
@ |@!blank| is used for initialising strings. It should not be altered
anywhere but here.
@<Set init...@>=
blank.len := 0 ;
for in_i := 1 to string_length do
blank.data[in_i] := ' ' ;
@ @<Glob...@>=
blank: var_string ;
@ Here are two small procedures for printing strings.
@<Lowest...@>=
procedure string_show(ss: var_string);
var s_n: byte ;
begin
for s_n := 1 to ss.len do display(ss.data[s_n]) ;
end;
procedure string_print(ss: var_string);
var s_n: byte ;
begin
for s_n := 1 to ss.len do print(ss.data[s_n]) ;
end;
@ @<Forw...@>=
procedure upcase(var s: var_string) ; forward;
{convert to upper case}
@ @<Lowest...@>=
procedure upcase ;
var i: s_ptr; k: byte ;
begin
for i := 1 to s.len do begin
k:= zord(s.data[i]) ;
if (k >= "a" ) and (k <= "z" ) then
s.data[i] := zchr(k + "A" - "a" ) ;
end;
end;
@ We also use |var_strings| for command strings to be sent to the printer.
These nearly always use unprintable ASCII characters, typically ESCAPE. So we
need a special procedure to initialise them. It copies the |in_data| into the
|result|, but changes each |flag| into the character |zchr(escape)|. We
determine the length by assuming that the |in_data| string is padded with some
character; then we run backwards along it until we hit the latest non-pad
character. Since \.{TANGLE} imposes a limit of 69 on the length of quoted
strings, we use the |row_str| type defined elsewhere.
@.ASCII@>
@<Lowest...@>=
procedure set_string
(var result: var_string; in_data: row_str;
flag: char; escape: byte );
var i: byte ; last:char ;
begin
result := blank ;
i := row_length ;
last := in_data[i] ;
while (in_data[i] = last) and (i > 1 ) do decr(i) ;
if (i=1) and (in_data[1] = last ) then i := 0 ;
{YEUCH! but if I write this in a natural way, it crashes when |i=0|}
result.len := i ;
for i := 1 to result.len do
if in_data[i] = flag then result.data[i] := zchr(escape)
else result.data[i] := in_data[i] ;
end;
procedure add_char(var s: var_string; c: char) ;
begin
if s.len >= string_length then warn('string too long')
else begin
incr(s.len) ;
s.data[s.len] := c ;
end;
end;
@.error: string too long@>
procedure append(var head: var_string; tail: var_string) ;
var k: integer;
begin
if head.len + tail.len > string_length
then warn('string too long')
else begin
for k := 1 to tail.len do
head.data[ k + head.len] := tail.data[ k] ;
head.len := head.len + tail.len ;
end;
end;
@.error: string too long@>
@ The next procedures generate substrings. If the character |c| is present in
|s|, then |chop_top| deletes the first |c| from |s|, and all successive
characters. |chop_tail| deletes the last |c| and all previous characters.
|chop_length| chops the string to the stated length.
@<Lowest...@>=
procedure chop_top(var s: var_string; c:char );
var t: var_string; i,n: byte ;
begin
n := 1 ;
while (n <= s.len ) and (s.data[n] <> c) do incr(n);
if n <= s.len then begin
t := blank ;
for i := 1 to n-1 do t.data[i] := s.data[i ];
t.len := n-1 ;
s := t ;
end;
end;
procedure chop_tail(var s: var_string; c:char );
var t: var_string; i,n: byte ;
begin
n := s.len ;
while (n >= 1 ) and (s.data[n] <> c) do decr(n);
if n >= 1 then begin
t := blank ;
for i := 1 to s.len - n do t.data[i] := s.data[n+i];
t.len := s.len - n;
s := t ;
end;
end;
procedure chop_length(var s: var_string; k: integer);
var n: integer;
begin
if (k < 0 ) or (k > s.len) then
warn ('Impossible length supplied to chop_length' )
else begin
for n:= k+1 to s.len do s.data[n] := ' ' ;
s.len := k ;
end;
end;
@.error: impossible length@>
@ Printer commands usually have the format (prefix)(parameter)(suffix). These
procedures print the parameter. |s| is one character, and may have the values
`B'(yte), `D'(ecimal), `H'(exadecimal), `O'(ctal), or `W' (a 16-bit signed
word, in twos-complement notation).
@<Lowest...@>=
procedure string_base(var result:var_string; n:integer; s:byte) ;
{ |n| to base |s| . Note that the integer is appended to |result|}
var nh : integer ;
begin
nh := n ;
if nh < 0 then begin add_char(result, '-'); nh := - nh ; end ;
if nh >= s then begin
string_base(result, nh div s, s) ;
nh := nh mod s ;
end ;
if nh >= 10 then add_char(result, zchr(nh - 10 + "A" ))
else add_char( result, zchr(nh + "0" )) ;
end;
procedure string_integer (var ss: var_string; n:integer; c:char);
var nn: integer ;
begin if c = 'O' then string_base(ss, n, 8)
else if c = 'H' then string_base(ss, n, 16)
else if c = 'D' then string_base(ss, n, 10 )
else if (c = 'B') and (n >= 0) and (n <= 255)
then add_char(ss, zchr(n))
else if c = 'B' then warn('out-of-range byte')
else if c='W' then begin
if (n>= 0) and (n <= 32767) then begin
add_char(ss, zchr(n div 256));
add_char(ss, zchr(n mod 256));
end
else if (n<0 ) and (n> -32768) then begin
nn := n + 65536 ;
add_char(ss, zchr(nn div 256));
add_char(ss, zchr(nn mod 256));
end
else warn('out-of-range word') ;
end
@<Hook for weird parameter types@>
else warn('string_integer called with illegal type') ;
end;
@.error: out-of-range...@> @.error: string_integer called...@>
procedure print_integer (n:integer; c:char);
var ss: var_string;
begin ss := blank ;
string_integer(ss, n, c) ;
string_print(ss) ;
end;
@ @<Hook for weird ...@>=
@* Translating the device-independent file, 5: Movements.
This section considers the problem of deciding where each character has to be
printed on the printer's page. This is by far and away the most difficult (and
unsatisfactory) part of \.{Crudetype}. The current version is not a properly
designed algorithm; it is merely a bodge, obtained by a lot of trial and
error. It does seem to give tolerable results on \.{WEB} files, lineprinter,
and VMS. The main variables are: |@!D_h| is `\TeX's cursor'. It gives the
`exact' horizontal position (in \.{DVI} units) generated by \.{DVI} commands.
This is always updated exactly as in \.{DVItype}. |@!IM_h| is the `page
image's cursor'. It marks the position (in |h_steps|) where the next character
will be set.
The procedure |round_IM_h| is called immediately before we set a character or
a rule. We have to take account of all the movements that occurred since the
last previous character was set.
@<Forw...@>= procedure round_IM_h( code: byte); forward ;
@ @<Lowest...@>=
procedure round_IM_h ;
var
s_top, diff, n, m,
delta, new_IM_h, rounded_h : integer ;
begin
@<Find the new position |new_IM_h|@>
IM_h := new_IM_h ;
end;
@ The obvious method is to multiply |D_h| by a factor |h_conv| and round to
nearest integer. This gives extremely bad results, because the characters in
\TeX\ fonts vary enormously in width, while many crude printers have
|fixed_width| characters. If |h_conv| is too large, then you get spaces in the
middle of words. If |h_conv| is too small, then successive characters in a
word get printed on top of each other. With an intermediate value of |h_conv|,
you get both effects at once; in other words, the characters in \TeX\ fonts
vary so much in width that the `too large' and `too small' values of |h_conv|
overlap. In this situation, a great deal of jiggery-pokery is needed to get a
tolerable result (sometimes! I have not been able to make this code work in
general.)
For a start, here is the algorithm used in \.{DVItype}. |D_h_right| and
|IM_h_right| give the latest value of |D_h| and |IM_h| after the latest
previous character or rule was set. If the horizontal motion is small, like a
kern, |IM_h| changes by rounding the kern; but when the motion is large,
|IM_h| changes by rounding the true position |D_h| so that accumulated
rounding errors disappear. Also, we insist that the total amount of drift is
bounded.
@d h_step_round(#) == round(h_conv * # )
@d max_drift == 2
@<Find the new position |new_IM_h|@>=
rounded_h := h_step_round(D_h) + l_margin ;
delta := D_h - D_h_right ;
if (delta > thin_space) or (delta <= -4*thin_space) then
new_IM_h := rounded_h
else new_IM_h := IM_h_right + h_step_round(delta);
if not fixed_width then begin
if new_IM_h > rounded_h + max_drift
then new_IM_h := rounded_h + max_drift
else if new_IM_h < rounded_h - max_drift
then new_IM_h := rounded_h - max_drift ;
end else
@ Calculating |IM_h| on a |fixed_width| printer is very hairy. If we are not
careful, then the spaces between words will sometimes get rounded to 0. Since
we round `large' movements by rounding |D_h|, the space may even get rounded to
a negative value, if there was previously a lot of drift. So we must re-round
|new_IM_h|. The next idea is that whenever \TeX\ moves right by an amount that
seems large enough to be a space between words, we force |IM_h| to increase.
@<Find the new position |new_IM_h|@>=
if (delta > thin_space) and (new_IM_h < IM_h_right + gap_width)
@<Except in some special cases@>
then new_IM_h := IM_h_right + gap_width
else if (delta > thin_space) then do_nothing
else if (delta > - 2*thin_space)
then new_IM_h := IM_h_right
else
@ Here are two little fudges which improve the result. First, when \TeX\ puts
out a thin space, it sometimes is a bit too small to be recognised as such. So
we reduce the |font_space| when a font is defined.
@<Read the font parameters...@>=
font_space[nf] := round(font_space[nf] * 0.99 ) ;
@ The next fudge is needed to handle tables of contents. \TeX\ prints these by
putting out long streams of dots with small spaces in between. If these spaces
all get expanded to a whole character width, the right hand columns get thrown
right off the paper. So dont expand if the next character is a stop or comma.
@<Except in some special cases@>=
and not ( ( ( code = ".") or ( code = ",") ) and
( ( cur_scheme > 0) and ( cur_scheme <= max_plain )))
@ When these alternatives fail, we have lost contact between |D_h| and
|D_h_right|. This happens when \TeX\ makes a large backspace; in fact \TeX\
seems nearly always to do large backspaces by |pop| rather than an explicit
move left. \TeX\ often expresses boxes by a sequence like this:
\centerline{\tt{
PUS\markarrow{H} Move right ------------>
\markarrow{[}set characters] \markarrow{P}OP }}
followed by zero or more |push|es, then by a move either to one of the
positions marked by the arrows, or close by. I try to deal with this by
dropping markers at each of the arrowed positions. The markers are labelled
|D_h_right|, etc, and each marker has a corresponding value of |IM_h|
attached.
@<Glob...@>=
D_h_left, IM_h_left, D_h_mid, IM_h_mid, D_h_right, IM_h_right, {the markers}
IM_h, IM_v, D_dis, IM_dis: integer;
IM_h_stack, IM_v_stack:
array [0..max_stack+2] of integer; {pushed down values }
@ Suppose that we are about to set a character, and |D_h-D_h_right| is large
and negative. Then we compare the current value of |D_h| with all the markers.
Let |m| be the closest of these, and |mm| the corresponding rounded value.
Then we re-round |new_IM_h| to force it to lie on the `correct' side of |mm|.
This seems to work fairly often, but it does sometimes slip. First put the
markers on top of the stack...
@<Find the new position |new_IM_h|@>=
begin s_top := stack_ht ;
D_h_stack[s_top] := D_h_left;
IM_h_stack[s_top] := IM_h_left;
incr(s_top) ;
D_h_stack[s_top] := D_h_mid;
IM_h_stack[s_top] := IM_h_mid;
incr(s_top) ;
D_h_stack[s_top] := D_h_right;
IM_h_stack[s_top] := IM_h_right;
@ ...then look for the stacked value closest to |D_h|...
@<Find the new position |new_IM_h|@>=
m := s_top ;
for n := s_top downto 1 do begin
diff := D_h - D_h_stack[n] ;
if abs(diff) <= abs(delta) then
begin m := n ; delta := diff; end ;
end;
@ ...then adjust |new_IM_h| by reference to this point on the stack.
@<Find the new position |new_IM_h|@>=
if (delta > thin_space ) and ( new_IM_h < IM_h_stack[m] + gap_width) then
new_IM_h := IM_h_stack[m] + gap_width
else if (delta < -thin_space )
and ( new_IM_h > IM_h_stack[m] - gap_width)
then new_IM_h := IM_h_stack[m] - gap_width
else if abs(delta) <= thin_space then new_IM_h := IM_h_stack[m];
@ We must assign values to these markers. When we start a page, all the
markers that were left over from the previous page are irrelevant. So we reset
them. This is a good place to consider margins. The standard arrangement given
in the \TeX book (Chapter 23) is that \.{DVI} point $(0,0)$ is about an inch
in from the top and left edges of the paper. But a negative {\tt \BS hoffset}
allows \.{DVI} to address points with negative coordinates, which should still
be on the paper. It seems that the least messy way to implement this is by
adding |l_margin| to |IM_h|, whenever this is set to an absolute value.
@.TeXbook@>@.Margins@>
@<Set up an empty page image@>=
IM_h := @!l_margin ;
IM_v := @!top_margin ;
D_h_left := 0 ; IM_h_left := l_margin ;
D_h_mid := 0 ; IM_h_mid := l_margin ;
D_h_right := 0 ; IM_h_right := l_margin ;
@ So now we consider the three arrows in turn. The left hand arrow will be
marked by |@!D_h_left|. It records the latest horizontal position to be
|push|ed. There might have been a |pop| since then, so it is not necessarily
the value at the top of the stack. If we just record |IM_h| whenever we
|push|, that would give a wrong value whenever there was a sequence
|push..move_right..push|. So we must rectify the pushed value of |IM_h|.
@ @<Some adjustments...@>=
IM_h_stack[stack_ht]:=IM_h;
IM_v_stack[stack_ht]:=IM_v;
if just_pushed and (stack_ht > 0) then begin
x := h_conv*(D_h_stack[stack_ht] - D_h_stack[stack_ht - 1] );
if abs(x) > 1.5 {a guess!} then
IM_h_stack[stack_ht] := IM_h_stack[stack_ht] + round(x) ;
end;
D_h_left := D_h ;
IM_h_left := IM_h_stack[stack_ht] ;
@ The centre arrow will be marked by |@!D_h_mid|. This is defined as the value
of |D_h| just before setting the first character after the latest |push|.
@<Find the new position |new_IM_h|@>=
if just_pushed then begin
D_h_mid := D_h ;
IM_h_mid := new_IM_h;
just_pushed := false;
end;
@ The right hand arrow is marked by |@!D_h_right|. At any time, this is
defined as the right hand edge of the latest previous character (or rule) that
has just been set. This equals |D_h + D_dis|, where |D_dis| is the \TeX\ width
of the character. Usually there will follow a |move_right| that updates |D_h|,
but |D_h_right| must be updated even if there is no |move_right|. Now
|@!IM_h_right| must be aligned with the right hand edge of the printed
representation of the character. The idea is that this will usually be the
exact place where the next character has to be set. We hope that all the
characters in each word will be correctly placed next to one another and the
accumulated drift will appear in spaces between the words. So whenever a
character is set, we must assign values to |D_dis| and |IM_dis|. The character
is described by |cod|, and its printed width is written into its |breadth|
field; but if it is a multiple character, then the |breadth| is the negative
of the width.
@<Do messy things...@>=
D_dis := D_width[D_font, c_num] ;
if cod.breadth = bad_char then IM_dis := 0
else IM_dis := abs(cod.breadth) ;
@<Set |rail_base|@>
D_h_right := D_h + D_dis ;
IM_h_right := IM_h + IM_dis ;
@ So the procedure |row| must give the |breadth| field the right value when
assembling a |multi| character. Recall that that character can be either an
|o_list| or an |h_list| or a |v_list|, and |c| tells us which it is. An
|o_list| is assumed to have a width of one |char_width| and the width of a
|v_list| is the width of its widest component. The width of a |h_list| gets
accumulated in |delta_h| as the list is assembled.
@<Copy |lig_buff|...@>=
if c = o_bra then print_width := char_width
else if c = h_bra then print_width := delta_h
else begin
print_width := char_width ;
for i := 1 to buf_num do
with lig_buff[2*i].code do
if (print_width < -breadth ) and (breadth > -30000 )
then print_width := -breadth ;
end;
value.breadth := - print_width ;
@ @<Row locals...@>=
print_width: integer ;
@ We must do the same thing when setting a rule.
@<|Post| set...@>=
D_dis := D_rul_width ;
IM_dis := hn * post_width ;
D_h_right := D_h + D_dis ;
IM_h_right := IM_h + IM_dis ;
@ @<|Rail| set...@>=
D_dis := D_rul_width ;
IM_dis := hn * rail_width ;
D_h_right := D_h + D_dis ;
IM_h_right := IM_h + IM_dis ;
@ \.{DVItype} handles vertical motion in the same sort of way as horizontal.
@d v_step_round(#) == round(v_conv * # )
@<Medium...@>=
procedure move_down(ddd: integer);
var new_IM_v , delta : integer;
begin
D_v:=D_v+ddd;
delta := v_step_round(ddd) ;
@<Find a vertical position |new_IM_v|@>
end;
@ @<Find a vert...@>=
if delta >= big_drop then begin
new_IM_v := v_step_round(D_v) + top_margin ;
if new_IM_v < IM_v + big_drop then
IM_v := IM_v + big_drop
else IM_v := new_IM_v ;
rail_base := IM_v * rail_types ;
end
else if delta <= -big_drop then begin
new_IM_v := v_step_round(D_v) + top_margin ;
if new_IM_v > IM_v - big_drop then
IM_v := IM_v - big_drop
else IM_v := new_IM_v ;
rail_base := IM_v * rail_types ;
end else
@ The above calculation fails for small motions. Because \TeX\ expects
subscripts to be about half the size of the main line, it drops them by only a
small amount; with a crude printer, this small amount gets rounded to zero. If
the move is smaller than |@!tiny_drop| \.{DVI} units, we ignore it. If not,
then we force the new value of |IM_v| to be different from the old.
@<Find a vert...@>=
begin
IM_v := IM_v + delta ;
rail_base := rail_base + v_step_round(ddd * rail_types) ;
if (ddd > tiny_drop) and ( delta = 0) then IM_v := IM_v + 1
else if (ddd < -tiny_drop) and ( delta = 0) then IM_v := IM_v - 1
else rail_base := IM_v * rail_types ;
end;
@ The next bit is put in to help catch bugs. Sometimes the \.{DVI} file tries
to address an absurd position; for example, I contrived to make \TeX\ generate
a {\tt \BS hbox} that was 9000 points wide. If we do nothing about this,
\.{Crudetype} will probably crash with an arithmetic error, which is
unacceptable. So any character falling outside the limits |h_min..h_max| and
|v_min..v_max| will generate an error report.
@<Check the position@>=
if (Set_h < h_min) or ( Set_h > h_max )
then begin
warn('out of bounds position') ;
Set_h := h_min ;
{Chuck the character somewhere, hopefully out of the way}
end;
if (Set_v < v_min) or ( Set_v > v_max )
then begin
warn('out of bounds position') ;
Set_v := v_min ;
end;
@.error: out of bounds@>
@ @<Set init...@>=
h_max := h_resolution * 100 ;
v_max := v_resolution * 100 ;
h_min := -10 * h_resolution ;
v_min := -10 * v_resolution ;
@ Note that since the position fields of a |page_record| are subranges,
|h_max| etc. must be of the same type.
@<Glob...@>= h_max, v_max , h_min, v_min : i_word ;
@* Sorting the page.
Once we have assembled the complete page image, we must sort it. The method
used here is a merge sort based on the country dance called Grand March.
@<Sort the page@>=
@<The dancers form a long line up the middle of the hall and march
up towards the Presence@>
repeat
@<At the top they split, and alternate groups go to the left and right and
march down the sides@>
@<At the bottom of the hall, each group coming from the right hand side
merges with a group from the left side, and they go up again @>
until sorted;
@ Since the data being sorted is of unpredictable size and sequentially
processed, it logically ought to be a |file|. But this turned out to make the
program spectacularly slow. So I use linked lists instead--- a sacrifice of
logic to economy. But I continue to use file-like language.
@d send_one_set_to( #)==
copy_from( mid ) ( # )
@<At the top...@>=
L_reset( mid) ;
L_rewrite( left) ;
L_rewrite( right) ;
repeat
send_one_set_to( left) ;
if not L_eof( mid) then
send_one_set_to( right) ;
until L_eof( mid);
@ Eventually everybody comes together in one enormous set and the dance is
finished. The easiest way to detect this is to let it go round one more time.
Then the left side of the hall will be full and the right hand side empty.
@<At the bottom...@>=
L_rewrite( mid) ;
L_reset( left) ;
L_reset( right) ;
sorted := L_eof( right) ;
if sorted then
page_ptr := son( next( left))
else repeat
if L_eof( right) then copy_from( left) ( mid)
else if L_eof( left) then copy_from( right) ( mid)
else @<Merge one group from each side@>
until L_eof( left) and L_eof( right) ;
@ The natural way to assemble the page image is to throw everything into one
huge list, then start sorting. But the code for merging two simple lists was
horribly complicated. (The code given here merely merges two runs.) So the
page image is a list of lists (another sacrifice of logic to economy). Each
top-level entry has a |son|, which points to a sub-list. This is a sorted
subset (a ``run'') of the data. One advantage of the list-of-lists structure
is that we can take advantage of the fact that \TeX\ output is very ``runny''.
I found that this made \.{Crudetype} run at least 3 times faster than before.
@d Add_run == new_tail( mid_ptr) ; son( mid_ptr) := run_ptr ;
@<Merge one group...@>=
begin
L_rewrite( run) ;
L_run_ptr := son( left_ptr) ;
R_run_ptr := son( right_ptr) ;
repeat
if @<The person on the left is more eligible@>
then copy_from( L_run) ( run)
else copy_from( R_run) ( run) ;
until L_eof( R_run) and L_eof( L_run) ;
step_wipe( left_ptr) ;
step_wipe( right_ptr) ;
L_reset( run) ;
Add_run ;
end;
@ So while the page image is being assembled, it must be divided into runs.
@<Add the record...@>=
begin
if out_of_sequence then begin {create a new run}
L_reset( run) ;
Add_run ;
L_rewrite( run) ;
end;
new_tail( run_ptr ) ;
with image( run_ptr) do begin {write the data into it}
hpos := Set_h ; Old_h := Set_h ;
vpos := Set_v ; Old_v := Set_v ;
data := cod ;
end;
incr(page_size) ;
if page_size >= page_max then abort(
'overflowed page: either a bug, or recompile with larger page_max' ) ;
end
@.Fatal: overflowed page@>
@ Once the lists are all assembled, we must |reset| them before sorting.
@<The dancers...@>=
sorted := false;
L_reset( run) ;
Add_run ;
@ Now we must specify the desired order!! That is: increasing |vpos| and
|hpos|, |vpos| is more significant.
@d out_of_sequence ==
( ( Old_v > Set_v) or ( ( Old_v = Set_v) and ( Old_h > Set_h)))
@<The person on the left is more eligible@>=
( ( image( L_run_ptr).vpos < image( R_run_ptr).vpos) or
( ( image( L_run_ptr).vpos = image( R_run_ptr).vpos)
and ( image( L_run_ptr).hpos <= image( R_run_ptr).hpos)))
@ And here we get it all started. Since |garbage| wipes out everything in the
|pool| array above |zzz|, the following code effectively makes |mid..run|
permanent.
@<Set init...@>=
first_cell ;
make_new( mid );
make_new( left );
make_new( right );
make_new( run );
make_new( zzz );
image(zzz).vpos := max_half;
next(zzz) := zzz ;
mid_ptr := zzz ;
run_ptr := zzz ;
@ @<Set up an empty page image@>=
garbage ;
L_rewrite( mid) ;
L_rewrite( run) ;
page_size := 0 ;
Old_v := -max_half ;
@ @<Glob...@>=
zzz, cell, tempp, page_ptr,
mid, mid_ptr, run, run_ptr,
left, left_ptr, L_run_ptr ,
right, right_ptr, R_run_ptr : link;
page_size: page_i ;
Old_v, Old_h : i_word ;
sorted: boolean ;
declare_pool
@ Now we must define lots of machinery for handling lists. We could represent
a list by either a big array or dynamic storage. Neither is ideal, because an
array is bound to be either too big or too small; and some \.{PASCAL}s
apparently do not implement pointers. So I have expressed everything in terms
of certain macros, defined in the system dependent part of the program. In
theory, you can switch \.{Crudetype} from array to heap merely by redefining
these as follows:
\begintt
define image(#) == #^
define create == new(cell)
define first_cell ==
define link_type == ^page_record
define wipe_out(#) == dispose(#) ; { release data piecemeal}
define garbage ==
define declare_pool ==
\endtt
Both array and heap seem to work in VMS. I prefer to use an array because in
VMS, there seems to be no shortage of store, and an array is easier to debug.
Assuming these lowest-level macros, here is some machinery for handling lists.
We must deallocate cells after use. When using arrays, the |garbage| command
does it all in one go. Pointers must be |dispose|d one at a time, and the
obvious time is just after the data was used.
@d next(#) == image(#).prox
@d advance(#) == # := next(#)
@d make_new( #) == create; # := cell ;
@d new_tail( #) ==
create; next( #) := cell; # := cell ;
@d step_wipe( #) ==
tempp := # ; advance( #) ; wipe_out( tempp)
@ Suppose |L| is a list; then the actual variable |L| points to a permanently-
allocated cell which in turn points to the head of the list. |L_ptr| points to
the active end. After the list has been assembled, we first mark the tail, by
attaching a special element called |zzz|. Then we move the |L_ptr| round to
the head. |copy_from| must be used in the combination
{\tt copy\_from(A) ( B)}. It copies one element from the head of |A| to the
tail of |B|.
@d L_rewrite( #) ==
#@&ptr := # ; next( #) := zzz
@d L_reset( #) ==
next ( # @& ptr) := zzz ; #@&ptr := next( #)
@d L_eof( #) ==
( # @& ptr = zzz)
@d copy_end( #) ==
next( #@&ptr) := tempp ; advance( #@&ptr) ; end
@d copy_from( #)==
begin
tempp := #@&ptr ;
advance( #@&ptr ) ;
copy_end
@ Each top-level entry has the |false| type below; the |prox| field points to
the next top-level entry and the |down| field to a sub-list.
@d son(#) == image(#).down
@<Types...@>=
page_i = 0..page_max ;
link = link_type ;
page_record = packed record
prox: link ;
case boolean of
true: ( hpos : i_word;
vpos: i_word;
data: code_object ) ;
false: ( down : link) ;
end;
@* Processing a page of output.
The output of \.{Crudetype} is done by the procedure |Send_page|, which takes
the page and translates it for the printer. We shall process it a `line' at a
time, meaning all |page_records| with the same |vpos|. Initially |PR_font|
gets an impossible value so as to force an explicit |set_PR_font|.
@<Top level...@>=
procedure Send_page;
var line: link ;
begin
@<Pause reset@>;
PR_font := sentry;
PR_h := 0;
PR_v := 0;
repeat
line := read_line ;
do_line(line);
until L_eof( page) ;
end;
@ The function |read_line| runs along the page image until the vertical
position changes. It returns a pointer to a sublist which is the next line on
the page. As side effects, it moves the printer into position for this line,
advances |page_ptr| to the first record of the next line, and updates
|PR_v| and |PR_v_next|.
@<Medium...@>=
function read_line : link ;
var head, tail: link ; size: integer;
begin
head := page_ptr ;
size := 0 ;
PR_v_next := image(page_ptr).vpos ;
@<Move printer vertically to |PR_v_next|, update |PR_v| @>;
repeat
tail := page_ptr ;
advance(page_ptr) ;
PR_v_next := image(page_ptr).vpos ;
incr(size) ;
if size = max_line_size then
warn('excessively long line, probably this is a bug') ;
@.error: excessively long line@>
until ( ( L_eof( page) )
or (PR_v_next <> PR_v ) ) ;
next(tail) := zzz ;
read_line := head ;
end;
@ These variables all denote the printer fonts, etc.
@<Glob...@>=
PR_v, PR_v_next,
PR_h, PR_h_next,
PR_font : i_word ;
@ This procedure tries to print a line. The main difficulties are: we dont
want to |Backfeed| unless absolutely necessary; and we may have to deal with
overstruck characters. One possible way is to shunt them aside somewhere, then
print the |overflow| after the main line has been printed.
@<Medium...@>=
procedure do_line (line_ptr: link);
var overflow : link;
begin
overflow := zzz ;
while line_ptr<>zzz do
@<Process the character that |line_ptr| points to, and |advance| to
the next@>;
@<End the line, trying very hard not to over-feed the paper and print
the |overflow|@>;
@<Check pause@>;
end;
@ We are actually getting almost in sight of the printer!!! Before we can
actually print a character, we must first check if it has to go to the
|overflow|...
@<Process the char...@>=
with image(line_ptr) do begin
PR_h_next := hpos ;
if not b_space_absolute and not b_space_by_string and
(PR_h_next < PR_h) then
begin
{AKT: ignore overflow stuff
next(overflow) := line_ptr ;
advance(overflow) ;
AKT}
advance(line_ptr) ;
end
else begin
@<Set horizontal position for the next character@>;
if data.IM_font <> PR_font then set_PR_font(data.IM_font);
print(zchr(data.IM_char )) ;
PR_h := PR_h + data.breadth ;
step_wipe(line_ptr ) ;
end ;
end;
@* Downloading. Not started yet.
@<Download a whole font@>= do_nothing
@ @<Enter a download...@>=
@* Carriage control.
Once the superior software has decided where the printer has to move to next,
this section has the job of translating the desired position into elementary
printer commands. Clearly this mapping depends very much on the range of
functions that the printer can perform. So this section is controlled by
several boolean constants; each asserts that the printer can do the
corresponding action. Here is a list of the most important ones:\item
|@!c_r_feed_dist| is the distance in |v_steps| by which a carriage-return
feeds the paper.\item
|@!w_l_feed_dist| ditto, |write_ln|. Similarly for the other |dist|
values.\item
|@!feed_absolute| says the printer has an absolute position command that takes
a parameter |IM_y|, say, and moves to position |IM_y v_steps| down the
page.\item
|@!b_feed_absolute| ditto, backfeeding.\item
|@!b_feed_by_string| says the printer has a |Backfeed| character that moves it
back by a fixed number |b_feed_dist| of |v_steps|. These booleans should not
be set true unless the printer can backfeed reliably.\item
|@!space_absolute| etc., Ditto, horizontal moves.\item
|@!abs_is_incr| says that in the absolute position commands, the parameter is
actually an incremental move.\item
|@!w_l_does_c_r| says that |write_ln| forces a carriage return.
As mentioned above, it is essential to avoid premature line feeds as much as
possible. Also, many operating systems will choke if the output record gets
too long, so we must do a |print_ln| at intervals. This program tries to
accommodate various types of carriage control, some of which are not in use at
the author's site. This means that several pieces of code have not been
tested. Installers may find that the procedures defined here will need to be
carefully studied in conjunction with the I/O section of their \PASCAL\
manual.
@<Const...@>= @<Carriage control constants@>
@ Now consider what happens at the end of each line. We will want to do a
subset of the following things: carriage-return, print the |overflow|, line
feed, split output records. We must keep a clear separation between these
tasks, and we want to do them in the stated order (but we cannot if
|fortran|). This order puts most of the carriage controls to the ends of the
output records, and (on our machine) makes it easier to examine the output
file with an editor. So first: do we want to do carriage-return? If so, then
the natural way is to print a carriage-return, but not if it will over-feed
the paper.
@<End the line...@>=
if not w_l_does_c_r {Return is compulsory}
or (c_r_feed_dist = 0) {Return is harmless }
or b_space_absolute
or ((not want_split or (overflow <> zzz ) ){We can choose C-R or W-L}
and (c_r_feed_dist < w_l_feed_dist))
then begin
if b_space_absolute and ((c_r_feed_dist > 0) or (l_margin > 0)) then
set_h_abs(0)
else begin
if fortran then print_ln ;
if not list then {AKT: only send CR if not list}
print(c_r_char);
PR_h := 0;
PR_v := PR_v + c_r_feed_dist ;
end;
@ Now for the |overflow|. We will split records if that is harmless.
@<End the line...@>=
if overflow <> zzz then begin
if not fortran and (w_l_feed_dist = 0) then print_ln ;
{AKT: must do this if list???}
next(overflow) := zzz ;
overflow := next(zzz) ;
do_line(overflow);
end;
@<Reset printer at end of line, if necessary @>
@ @<Reset printer...@>= {hook}
@ Now we decide whether to do any |line_feed|s. But first, we may have to
attempt to |Backfeed|. Sometimes the program will fail; it should not do so
unless the \.{DVI} file calls for overstruck characters and the printer
genuinely cannot do them. If |b_feed_scream|, then print an error message.
@<Move printer...@>=
if want_split then PR_v_next := PR_v_next - w_l_feed_dist ;
if (PR_v_next < PR_v) then begin
if b_feed_absolute then set_v_abs(PR_v_next)
else if b_feed_by_string then
while PR_v_next < PR_v do @<Backfeed@>
else if b_feed_scream then begin
warn('this printer cant feed backwards');
@.error: printer cant...@>
display_ln('approximate vertical position is: ', PR_v_next);
display_ln(' printing over-fed line on line below');
display_ln(' ');
PR_v := PR_v_next;
end;
end;
@ If we avoided over-feeding, we may want to feed forwards.
@<Move printer...@>=
if PR_v_next > PR_v then begin
if feed_absolute then set_v_abs(PR_v_next)
else begin
while PR_v_next >= PR_v + feed_dist do @<Line feed@>;
while PR_v_next > PR_v do @<Tiny feed@> ;
end;
end;
if want_split then begin
PR_v := PR_v + w_l_feed_dist ;
PR_v_next := PR_v_next + w_l_feed_dist ;
if not list then print_ln; {AKT: was just print_ln;}
if w_l_does_c_r then PR_h := 0 ;
end;
@ We set the horizontal position in a similar way, but we do not need to be so
paranoid about backspacing as about back-feeding.
@<Set horiz...@>=
if PR_h_next = PR_h then
else begin
if (PR_h_next < PR_h) then begin
if b_space_absolute then set_h_abs(PR_h_next)
else if b_space_by_string then
while PR_h_next < PR_h do @<Backspace@>;
end;
if space_absolute and (PR_h_next > PR_h )
then set_h_abs(PR_h_next)
else begin
while PR_h_next >= PR_h + space_dist do @<Space@>;
while PR_h_next > PR_h do @<Tiny space@> ;
end;
end;
@* Low level modules for printer control.
Now we have to translate these elementary printer commands into actual strings
of characters to be put into |printfile|. Here is the command for setting a
new printer's font.
@<Lowest...@>=
procedure set_PR_font(new:integer) ;
begin
if (new = PR_font) or only_one_font then
else
begin
string_print(font_prefix) ;
print_integer(new, param_type);
string_print(font_suffix) ;
PR_font := new ;
end;
end;
@ Now for |absolute| movements, if the printer can do them. The procedure
|set_v_abs| moves the printer to position |mm h_steps| below the top of the
paper. If |abs_is_incr| then the printers `absolute' command is actually an
incremental command. So the parameter sent to the printer must be decreased by
|PR_v|.
@<Forward...@>=
procedure set_v_abs(mm: integer) ; forward ;
procedure set_h_abs(mm: integer) ; forward ;
@ @<Lowest...@>=
procedure set_v_abs;
var new_pos :integer ;
begin
if abs_is_incr then
new_pos := mm - PR_v
else new_pos := mm ;
string_print (v_abs_prefix) ;
print_integer(new_pos, param_type) ;
string_print (v_abs_suffix) ;
PR_v := mm ;
end;
procedure set_h_abs;
var new_pos :integer ;
begin
if abs_is_incr then
new_pos := mm - PR_h
else new_pos := mm ;
string_print (h_abs_prefix) ;
print_integer(new_pos, param_type) ;
string_print (h_abs_suffix) ;
PR_h := mm ;
end;
@ Now consider commands for printers that can only do simple movements. A
|tiny| movement is usually a movement of one |h_step| or |v_step|. All these
modules should be protected, so they cannot be called unless the printer can
actually do the stated movement. Normally, the command strings for these are
only simple characters, so we can just |print| them.
@<Formfeed@>=
{AKT: was page(printfile);}
print_ln;
print_ln;
print_ln;
print('-------------------- [new page] --------------------');
print_ln;
print_ln;
if is_header then
string_print (page_top);
@ @<Backfeed@>=
begin string_print(b_feed_string);
PR_v:=PR_v - b_feed_dist;
end
@ @<Line feed@>=
begin
if fortran then begin
print_ln ;
PR_v := PR_v + w_l_feed_dist;
end;
if list then {AKT: was just print(feed_char);}
print_ln
else
print(feed_char);
PR_v:=PR_v+feed_dist;
end;
@ @<Tiny feed@>=
begin print(t_feed_char); PR_v:=PR_v+t_feed_dist; end;
@ @<Backspace@>=
begin print (b_space_char); PR_h:=PR_h-b_space_dist; end;
@ @<Space@>=
begin print (space_char); PR_h:=PR_h+space_dist; end;
@ @<Tiny space@>=
begin print (t_space_char); PR_h:=PR_h+t_space_dist; end;
@* Default declarations for printer.
Here we define a lot of printer-dependent material that is expected to be the
same for most printers. Of course, these will have to be changed if |fortran|,
or on a system that does not use ASCII codes. First, some command characters
for simple movements. |feed| means a vertical movement and |space| horizontal.
Each |thing_char| is the character needed to make the printer do the named
action. Owing to the rules of \.{TANGLE}, the words |back| and |tiny| have to
be abbreviated (to avoid identifier clashes). |c_r_char| etc. must be
consistent with the value of |fortran|.
@.ASCII@>
@<Set init...@>=
space_char := chr(32) ;
t_space_char := chr(32) ;
feed_char := chr(10) ;
t_feed_char := chr(10);
c_r_char := chr(13);
b_space_char := chr(8);
@ @<Glob...@>=
space_char,
t_space_char,
feed_char ,
t_feed_char ,
c_r_char ,
b_space_char : char ;
@ Next the distances that they normally move, always in |steps|.
@<Carriage control const...@>=
space_dist = 1;
b_space_dist = 1;
t_space_dist = 1;
feed_dist = 1 ;
w_l_feed_dist = 0 ;
b_feed_dist = 0;
t_feed_dist = 1;
c_r_feed_dist = 0 ;
tiny_drop = 500000 ; {AKT: avoid E in TEX dropping to next line}
{tiny_drop = 50000 ; slightly less than a point}
big_drop = 4 ;
@ |start_stuff| and |stop_stuff| get written into the start and end of
|printfile|. They are intended to: set printer into correct state for \TeX\
output, and reset printer to standard state afterwards. If the printer needs
to be re-initialised in any way at the top of each page, then set |@!page_top|
to the necessary data and set |is_header| to |true|.
@<Open |printfile|@>=
string_print(start_stuff) ;
print_ln ;
@ @<Clean up afterwards@>=
string_print(stop_stuff);
@ @<Glob...@>=
start_stuff,
stop_stuff,
page_top,
b_feed_string ,
font_prefix,
font_suffix,
v_abs_prefix,
v_abs_suffix,
h_abs_prefix,
h_abs_suffix : var_string ;
print_end : var_string ;
@* Printer dependent data.
This section should define masses of data to describe how the printer behaves.
In order to keep the size of each printer's change file within reasonable
bounds, I have replaced this section by a blank. The missing data is given in
the line printer change file. To set up for another printer, that file will
have to be extensively edited.
*** Attach printer change file here ***
@* Index.
Pointers to error messages appear here together with the section numbers
where each identifier is used.